Is it better coding practice to define variables outside a foreach even though more verbose?

Question

In the following examples:

the first seems more verbose but less wasteful of resources
the second is less verbose but more wasteful of resources (redefines string each loop)

Which is better coding practice?

First example:

using System;
using System.Collections.Generic;

namespace TestForeach23434
{
    class Program
    {
        static void Main(string[] args)
        {
            List<string> names = new List<string> { "one", "two", "two", "three", "four", "four" };

            string test1 = "";
            string test2 = "";
            string test3 = "";
            foreach (var name in names)
            {
                test1 = name + "1";
                test2 = name + "2";
                test3 = name + "3";
                Console.WriteLine("{0}, {1}, {2}", test1, test2, test3);
            }
            Console.ReadLine();
        }
    }
}

Second example:

using System;
using System.Collections.Generic;

namespace TestForeach23434
{
    class Program
    {
        static void Main(string[] args)
        {
            List<string> names = new List<string> { "one", "two", "two", "three", "four", "four" };

            foreach (var name in names)
            {
                string test1 = name + "1";
                string test2 = name + "2";
                string test3 = name + "3";
                Console.WriteLine("{0}, {1}, {2}", test1, test2, test3);
            }
            Console.ReadLine();
        }
    }
}

What do you mean, "wasteful"? The first one seems more wasteful, since it initializes the variables to values that will never be used. — John Saunders, Mar 05 '10 at 17:25
in terms of resources, e.g. in the sense that it is better coding practice to use a StringBuilder than constantly adding to a string. — Edward Tanguay, Mar 05 '10 at 17:26
I am interested in this answer too. I am interested because, in the second example, the code seems more concise, but objects get created and destroyed each time in the loop. — Aishwar, Mar 05 '10 at 17:28
I think you are talking apples and oranges... stringbuilder is more effective because it is a mutable object. Wheras String is immutable. Any modifications you make to a string result in a brand new object being created. — Josh, Mar 05 '10 at 17:28
@aip.cd.aish - The exact same objects are being created in both examples. Only the variable lifetimes are different. — Jeffrey L Whitledge, Mar 05 '10 at 17:42
@Jefferey: Yes, but since it is inside the loop, don't the 3 strings get created each time in the beginning of the loop and garbage collected at the end? Since they are continuously being GC'd, would this make your app slower (not specifically in this example, but as a general statement) — Aishwar, Mar 05 '10 at 17:51
@aip.cd.aish - No. If the string are ever GC'd, it will occur at some indeterminant point in the future, which is likely to be the same in both versions. The strings are not GC at the end of the loop. — Jeffrey L Whitledge, Mar 05 '10 at 18:00
I don't see how StringBuilder would help in this situation. I thought that for small number concatenations string was fastest. — Mike Polen, Mar 05 '10 at 18:12
@Jeffrey: Ah, I see. I think I see why I am confused (It's becase of the immutability of strings in the above example). If instead of the above, I had some code like this in the loop `Object a = new Object();`, it would be worse to do that in the loop, if I can create it once out of the loop and just modify the values in the loop - since if it was in the loop, objects are really being created and destroyed in the loop. — Aishwar, Mar 05 '10 at 18:13
@aip.cd.aish - That is correct (except that objects are not destroyed in the loop--only created). It relates to the immutability of strings only in the sense that the + operator in the above code actually does create a new instance of a string object, which may not be obvious. — Jeffrey L Whitledge, Mar 05 '10 at 18:21
@Mike Polen - You are correct that StringBuilder will not help in this particular situation, but not necessarily because of the number of concatenations. It is because, in this example, there are no intermediate strings that are used only in the concatinations. In this example, every string that is created gets passed to WriteLine, where it is an honest-to-goodness immutable string, even if StringBuilder had been used to create it. A better optimization would be the one propsed by mmyers below, since it doesn't create any new strings. — Jeffrey L Whitledge, Mar 05 '10 at 18:40
What "resources" are you speaking of? I don't understand what you mean by "resources", so it is hard to answer the question about "resources". The question about style is clear: place the local declaration such that the *scope* of the declared variable is the *lexically smallest* scope that guarantees that the variable's contents have the *correct lifetime*. — Eric Lippert, Mar 05 '10 at 20:44
I mean resources in the sense of "how much memory it allocates" — Edward Tanguay, Mar 09 '10 at 07:51

score 47 · Accepted Answer · answered Mar 05 '10 at 17:28

47

The second form is no more wasteful - it's simply better.

There's no advantage to declaring the variables outside the loop, unless you want to maintain their values between iterations.

(Note that usually this makes no behavioural difference, but that's not true if the variables are being captured by a lambda expression or anonymous method.)

answered Mar 05 '10 at 17:28

Jon Skeet

1,421,763
867
9,128
9,194

4

but if my foreach loop interates a million times, don't I take a performance hit by defining 3 million strings instead of 3 strings? – Edward Tanguay Mar 05 '10 at 17:29
17

You are still creating three million strings because String is an immutable type. When you do string + "1" you are effectively creating a brand new String object and assigning it to something, not modifying the original. – Josh Mar 05 '10 at 17:31
3

You get the same hit either way. If performance is the issue, then use a StringBuilder which is mutable and is actually reused. – Mar 05 '10 at 17:31
3

@Edwad Tanguay, behind the scene, space for a variable is only created once for the function, and reused for every iteration. Even more exact, the underlying layer has no notion of scoping and loops besides function and class scope. And string concatenation in a loop is never good, use StringBuilder as those above told. – Dykam Mar 05 '10 at 17:45
1

But more generally, Jon, for other object types, the addresses would be reused (in case B), correct? Even if the objects are modified within the loop. – harpo Mar 05 '10 at 18:25

score 18 · Answer 2 · answered Mar 05 '10 at 17:29

18

Personally, I think it's the best practice to declare variables in the tightest scope possible, given their usage.

This provides many benefits:

It's easier for refactoring, since extracting a method is simpler when the variables are already in the same scope.
The variable usage is more clear, which will lead to more reliable code.

The only (potential) disadvantage would be the extra variable declaration - however, the JIT tends to optimize this issue away, so it's one I wouldn't necessarily worry about in real work.

The one exception to this:

If your variable is going to be adding a lot of GC pressure, and if this can be avoided by reusing the same object instance through the foreach/for loop, and if the GC pressure is causing measured performance problems, I'd lift it into the outer scope.

answered Mar 05 '10 at 17:29

Reed Copsey

554,122
78
1,158
1,373

You had me right up to that last paragraph. Object lifetime is an entirely different question from variable lifetime. – Jeffrey L Whitledge Mar 05 '10 at 17:37
@Jeffrey: Yes - and I wasn't talking about his specific example here, where it makes no (technical) difference. However, I was trying to say there are times when moving a variable into an outer scope can be a valid means of changing the object lifetime, to prevent adding unneeded system pressure due to repeated allocations. – Reed Copsey Mar 05 '10 at 17:42
1

If a variable initialization allocates a new object on the heap, then moving that outside a loop could create fewer objects, but that is not the same as moving the variable declaration, or changing the variable's scope. There is a related issue with regards to GC pressure and variable scope, but it has the opposite effect than what you describe. A variable declared outside the loop has an expanded scope that may increase an object's lifetime unless the static analysis notes that the variable is not used after the loop completes. – Jeffrey L Whitledge Mar 05 '10 at 17:56
@Jeffrey: That was specifically why I included, in that statement: "can be avoided by reusing the same object instance through ..." I was making a general statement, in regards to my initial statement where I said to "declare variables in the tightest scope possible" - sometimes, moving to a larger scope means you can reuse a variable without reinitializing it, which can prevent multiple allocations from occurring (at the cost of having the object lifetime lengthened). – Reed Copsey Mar 05 '10 at 18:06

womp · Answer 3 · 2010-03-05T23:37:35.197

5

Those are both wasteful and verbose.

foreach (var name in names)
{
   Console.WriteLine("{0}1, {0}2, {0}3", name);
}

.

</tongueincheek>

edited Mar 05 '10 at 23:37

answered Mar 05 '10 at 17:27

womp

115,835
26
236
269

23

Actually, `Console.WriteLine("{0}1, {0}2, {0}3", name)` would be my preference. – Michael Myers Mar 05 '10 at 17:35

score 2 · Answer 4 · answered Mar 05 '10 at 17:32

2

Depending on language and compiler it may or may not be the same. For C# I expect the resulting code to be very similar.

My own philosophy on this is simple:

Optimize for ease of understanding.

Anything else is premature optimization! The biggest bottleneck in most development is time and attention of the developer. If you absolutely must squeeze out every last CPU cycle then by all means do so, but unless you have a compelling business need to do or are writing a critical component (common library, operating system kernel, etc) you are better off waiting until you can benchmark the finished program. At that time optimizing a few of the most costly routines is justified, before then it's almost certainly a waste of time.

answered Mar 05 '10 at 17:32

sorpigal

25,504
8
57
75

I agree with your statement, but when dealing with a loop you much consider the performance impact of your code. There is another question here, where re-initializing several variables in nested loop is taking thirty seconds. – AMissico Mar 05 '10 at 17:34
1

While I agree with the general point, I'd argue that optimizing for correctness is right up there with ease of understanding. So I'd ask "which one is easier to read" and "which one is less likely to cause weird side effects when modified in the future" well before "which is faster". – kyoryu Mar 05 '10 at 23:55

score 1 · Answer 5 · answered Mar 05 '10 at 17:29

1

I'm not sure what you gain by defining the string variable outside of the loop. Strings are immutable so they are not reused. Whenever you assign to them, a new instance is created.

answered Mar 05 '10 at 17:29

1

Interned strings are re-used -- of course, the calculated strings in this example will not be interned by default, so I take your point. – Eric Lippert Mar 05 '10 at 20:49

score 0 · Answer 6 · answered Mar 05 '10 at 17:29

I think it depends on what you are trying to solve. I like the second example, because you can move the code in one step. I like the first example, because it is faster due to less stack manipulations, less memory fragmentation, and less object construction/creation.

score 0 · Answer 7 · answered Mar 05 '10 at 17:30

0

I've found that 'hoisting' declarations out of loops is typically a better long-term strategy for maintenance. The compiler will usually sort things out acceptably for performance.

answered Mar 05 '10 at 17:30

Paul Nathan

39,638
28
112
212

score 0 · Answer 8 · answered Mar 05 '10 at 17:36

They are both virtually the same as far as performance goes (strings are immutable), but as for readability... I'd say neither is very good. You could easily just do it all within the Console.WriteLine.

Perhaps you can post the real problem instead of an example?

score 0 · Answer 9 · answered Mar 05 '10 at 18:22

0

I generally declare variables as close to their use as scoping allows, which in this case would be your second example. Resharper tends to encourage this style as well.

answered Mar 05 '10 at 18:22

heisenberg

9,665
1
30
38

score 0 · Answer 10 · answered Mar 05 '10 at 19:38

For POD type data declare closest to first use. For anything like a class that does any memory allocation, then you should consider declaring those outside of any loops. Strings will almost certainly do some form of allocation and most implementations (at least in C++) will attempt to reuse memory if possible. Heap based allocations can be very slow indeed.

I once profiled a bit of C++ code that included a class that new'd data in its ctor. With the variable declared outside of the loop it ran 17% faster than with the variable declared inside the loop. YMMV in C# so profile performance you may be very surprised at the results.

score 0 · Answer 11 · answered Oct 21 '16 at 11:27

0

This is my favourite part of Linq which I guess it fits here:

names.ForEach(x => Console.WriteLine("{0}1, {0}2, {0}3", x));

answered Oct 21 '16 at 11:27

meJustAndrew

6,011
8
50
76

score -1 · Answer 12 · answered Mar 05 '10 at 17:31

-1

Follow a simple rule while declaring variables

Declare it when you need it for the first time

answered Mar 05 '10 at 17:31

Anantha Kumaran

10,101
8
33
36

3

Unless you're using C, pre-C99. – Michael Myers Mar 05 '10 at 17:34
3

I have found that using that principle is fine until you have larger-scale software: having a centralized place to find your variable decls becomes useful in maintenance. – Paul Nathan Mar 05 '10 at 17:37
6

@Paul Nathan - If you are having trouble finding where a variable is declared in a method, then your method is way, way too long! – Jeffrey L Whitledge Mar 05 '10 at 18:05
@Jeffrey: I think Paul's point stands when you're talking about private member variables in a class. – Jacob G Mar 05 '10 at 18:07
@Jeffery: Probably, but we're not always given the choice to trash code handed to us and rewrite. IMO, better to push variables to the top of the routine as a standard. – Paul Nathan Mar 05 '10 at 18:12
1

@Paul I don't see how that offers any value, IMO that actually lowers debuggability because you might have to watch the variable from line 1 of the method instead just 3 lines before the end of the method. And I feel sorry for you about not being able to fix bad code I worked at a position like that I'm glad I'm in charge now at my current one and I always subscribe to fix things when you touch them. – Chris Marisic Mar 05 '10 at 18:26
@Chirs Marisic - Yes, fix it when you touch it! I used to add functionality to a piece of legacy code in which the method `OrderPost` was over 4000 lines long. I usually figured I could spend a week making and debugging the change, or I could spend three days refactoring and an hour making the change. It was an easy choice. – Jeffrey L Whitledge Mar 05 '10 at 18:32
@Chris: Well, it was a really gnarly research program with math I didn't fully understand, etc, etc. Simpler to decl at the top. :) – Paul Nathan Mar 05 '10 at 18:59
Sorry but in my opinion that is not a good rule to follow. Because when the thing you are building gets just a little complicated, you will, get confused. – Moulde Mar 05 '10 at 20:09

Is it better coding practice to define variables outside a foreach even though more verbose?

12 Answers12

Declare it when you need it for the first time

Linked