C++ allocates abnormally large amout memory for variables

Question

I recently got to know an integer takes 4 bytes from the memory.

First ran this code, and measured the memory usage:

int main()
{
   int *pointer;
}

enter image description here

It took 144KB.

Then I modified the code to allocate 1000 integer variables.

int main()
{
   int *pointer;

   for (int n=0; n < 1000; n++)
     { 
       pointer = new int ; 
     }
}

enter image description here

Then it took (168-144=) 24KB
but 1000 integers are suppose to occupy (4bytes x 1000=) 3.9KB.

Then I decided to make 262,144 integer variables which should consume 1MB of memory.

int main()
{
   int *pointer;

   for (int n=0; n < 262144; n++)
     { 
       pointer = new int ; 
     }
}

Surprisingly, now it takes 8MB

enter image description here

Memory usage, exponentially grows respective to the number of integers.
Why is this happening?

I'm on Kubuntu 13.04 (amd64)
Please give me a little explanation. Thanks!

NOTE: sizeof(integer) returns 4

For fun, try replacing `int` with `long double` and compare. — Kerrek SB, May 18 '13 at 12:53
Btw, an `int` need not be 4 bytes long. On 32-bit Intel CPUs (overly popular nowadays) it is, but nothing requires it's always 4 bytes. — , May 18 '13 at 12:53
Try printing the value of `pointer` each time through the loop. — Barmar, May 18 '13 at 12:58
I think the real question is why are you fudging around with CPP before learning how memory gets allocated... — TC1, May 18 '13 at 13:38
What my university lecturer told is, that it takes some memory from the RAM when you make variables. That's all. The lesson was over. It the nature of where I live :(. — Naveen, May 18 '13 at 14:03
This isn't what "exponentially" means. This is an example of an exponential: 1, 3, 9, 27, 243, 729, .... If it had actually scaled exponentially, the smallest number of bytes your 262,144 integers could possibly have taken was 53,157,183,946,379,858,456 KB! (Here, the exponent would be ~0.000154305.) — Rex Kerr, May 18 '13 at 20:08
Sorry I mean the other way... The instantaneous rate of change of memory usage increases. — Naveen, May 19 '13 at 07:54

Kerrek SB · Accepted Answer · 2013-05-18T14:16:35.573

15

Memory for individually allocated dynamic objects is not required to be contiguous. In fact, due to the alignment requirements for new char[N] (namely to be aligned at alignof(std::maxalign_t), which is usually 16), the standard memory allocator might just never bother to return anything but 16-byte aligned memory. So each int allocation actually consumes (at least) 16 bytes. (And further memory may be required by the allocator for internal bookkeeping.)

The moral is of course that you should be using std::vector<int>(1000000) to get a sensible handle on one million dynamic integers.

edited May 18 '13 at 14:16

answered May 18 '13 at 12:52

Kerrek SB

464,522
92
875
1,084

That explains why the multiplier is more than 4N, but not why it's not proportional to N. – Barmar May 18 '13 at 12:56
4

@Barmar: I don't believe that that's actually the case. The OP did like two experiments and claims this proves something. – Kerrek SB May 18 '13 at 12:59
1

Probably, an `int` allocation consumes 32 bytes. For `malloc` with glibc, you get two words of bookkeeping before the allocated area, iirc, so the smallest area that `malloc` ever uses is 32 bytes, I would be surprised if `new` was wildly different. `2^18 ints * 32 bytes/int = 8 MB`. – Daniel Fischer May 18 '13 at 13:14
Then why does sizeof(integer) returns 4? – Naveen May 18 '13 at 13:20
2

@Naveen Because the distance between to `int`s in an array of `int`s is 4. – Angew is no longer proud of SO May 18 '13 at 13:27
4

@Naveen Because an `int` needs 4 bytes of storage. But with `new`, you don't get just the memory you need for the `int`, but also allocation overhead. Somewhere, the size of the allocated memory needs to be stored, so that the implementation knows how much to `delete []` for example. So you get bookkeeping+alignment overhead _per allocation_. If you allocate `N` `int`s in one go, you get the overhead only once. – Daniel Fischer May 18 '13 at 13:29
3

@Naveen: Don't confuse *memory* and *objects*. It's like confusing flour and apple pie. You need one to get the other, but they feel very different when thrown into your face. – Kerrek SB May 18 '13 at 13:37
So, each time I dynamically allocate an integer, memory allocator has to store information about allocated memory. It takes some memory too. Additionally, 16 bytes are generally used to allocate my integer, even I use less than 16 bytes. So that's what takes up such a memory... Please tell me if I'm wrong.. I've never heard about **alignof(std::maxalign_t)** ,** vector(1000000) suff**.. – Naveen May 18 '13 at 13:51
1

@Naveen : See my answer. I added "a little more" detail on that :) – Pixelchemist May 18 '13 at 13:57
1

@Naveen: Objects need to be constructed at addresses that are suitably aligned, and the maximal alignment that's required to be supported is the one I said. So the allocator can just play it safe and always allocate with that alignment, and so you can take any memory address returned from the allocator to construct an object at that place. – Kerrek SB May 18 '13 at 13:57
@KerrekSB : I'm just at it :) Your answer is (as far is I can tell from my tests, using no optimization in MSVC) partially incorrect. A single int allocation will at least require 32 and not 16 bytes since the allocation information is stored in addition (at least without any compiler optimization). – Pixelchemist May 18 '13 at 14:13
@Pixelchemist: I suppose, though that would depend on the details of the allocator. I was speaking only about what's given by the C++ language. I added a note, though. – Kerrek SB May 18 '13 at 14:16
@KerrekSB : You're right but I think this is very likely to happen so you should (as you now do) at least mention it. I'll give you an upvote on that now :) – Pixelchemist May 18 '13 at 14:19

Pixelchemist · Answer 2 · 2013-05-18T14:21:50.310

Non-optimized allocations in common allocators go with some overhead. You can think of two "blocks": An INFO and a STORAGE block. The Info block will most likely be right in front of your STORAGE block.

So if you allocate you'll have something like that in your memory:

        Memory that is actually accessible
        vvvvvvvvvvvvvvvvvvvvvvvvvvvvvvvvvvvv
--------------------------------------------
|  INFO |  STORAGE                         |
--------------------------------------------
^^^^^^^^^
Some informations on the size of the "STORAGE" chunk etc.

Additionally the block will be aligned along a certain granularity (somewhat like 16 bytes in case of int).

I'll write about how this looks like on MSVC12, since I can test on that in the moment.

Let's have a look at our memory. The arrows indicate 16 byte boundaries.

Memory layout in case of int allocation; each block represents 4 bytes

If you allocate a single 4 byte integer, you'll get 4 bytes of memory at a certain 16 bytes boundary (the orange square after the second boundary). The 16 bytes prepending this block (the blue ones) are occupied to store additional information. (I'll skip things like endianess etc. here but keep in mind that this can affect this sort of layout.) If you read the first four bytes of this 16byte block in front of your allocated memory you'll find the number of allocated bytes.

If you now allocate a second 4 byte integer (green box), it's position will be at least 2x the 16 byte boundary away since the INFO block (yellow/red) must fit in front of it which is not the case at the rightnext boundary. The red block is again the one that contains the number of bytes.

As you can easily see: If the green block would have been 16 bytes earlier, the red and the orange block would overlap - impossible.

You can check that for yourself. I am using MSVC 2012 and this worked for me:

char * mem = new char[4096];
cout << "Number of allocated bytes for mem is: " << *(unsigned int*)(mem-16) << endl;
delete [] mem;


double * dmem = new double[4096];
cout << "Number of allocated bytes for dmem is: " << *(unsigned int*)(((char*)dmem)-16) << endl;
delete [] dmem;

prints

Number of allocated bytes for mem is: 4096
Number of allocated bytes for dmem is: 32768

And that is perfectly correct. Therefore a memory allocation using new has in case of MSVC12 an additional "INFO" block which is at least 16 bytes in size.

score 4 · Answer 3 · answered May 18 '13 at 12:52

4

You are allocating multiple dynamic variables. Each variable contain 4 bytes of data, but a memory manager usually stores some additional information about allocated blocks, and each block should be aligned, that creates additional overhead.

Try pointer = new int[262144]; and see the difference.

answered May 18 '13 at 12:52

nullptr

11,008
1
23
18

Yes, this is simply the simplest answer, a newbie can understand.. Thank you very much... – Naveen May 18 '13 at 14:29

score 2 · Answer 4 · answered May 18 '13 at 13:15

Each time you allocate an int:

The memory allocator must give you a 16-byte aligned block of space, because, in general, memory allocation must provide suitable alignment so that the memory can be used for any purpose. Because of this, each allocation typically returns at least 16 bytes, even if less was requested. (The alignment requires may vary from system to to system. And it is conceivable that small allocations could be optimized to use less space. However, experienced programmers know to avoid making many small allocations.)
The memory allocator must use some memory to remember how much space was allocated, so that it knows how much there is when free is called. (This might be optimized by combining knowledge of the space used with the delete operator. However, the general memory allocation routines are typically separate from the compiler’s new and delete code.)
The memory allocator must use some memory for data structures to organize information about blocks of memory that are allocated and freed. Perhaps these data structures require O(n•log n) space, so that overhead grows when there are many small allocations.

Another possible effect is that, as memory use grows, the allocator requests and initializes larger chunks from the system. Perhaps the first time you use the initial pool of memory, the allocator requests 16 more pages. The next time, it requests 32. The next time, 64. We do not know how much of the memory that the memory allocator has requested from the system has actually been used to satisfy your requests for int objects.

Do not dynamically allocate many small objects. Use an array instead.

score 1 · Answer 5 · edited May 18 '13 at 13:13

I think it depends on how the compiler creates the output program.

The memory usage of a program includes all the sections of the program (like .text, which contains the assembly directives of the program), so it takes some memory in space, when it's loaded.

And for more variables, the memory isn't really contiguous when you allocate some memory (memory-alignment), so it could take more memory than you think it takes.

score 1 · Answer 6 · answered May 18 '13 at 12:54

1

Two explanations:

Dynamic memory allocation (on the heap) is not necessarily contiguous. When using new you perform dynamic allocation.
If you are including debug symbols (-g compiler flag) your memory usage may be larger than expected.

answered May 18 '13 at 12:54

Marc Claesen

16,778
6
27
62

Debugging symbols shouldn't be dependent on the number of iterations of a loop. – Barmar May 18 '13 at 12:55
That's true, but they can make programs larger than expected. This is particularly true for the smaller tests, as you pointed out. – Marc Claesen May 18 '13 at 12:58
But the question is why the memory grows disproportionately with the number of integers he's allocating. – Barmar May 18 '13 at 12:59

score 1 · Answer 7 · answered May 18 '13 at 12:57

Each declaration makes a new variable suitable for the alignment options of the compiler which needs spaces between(starting address of a variable should be a multiple of 128 or 64 or 32 (bits) and this causes the inefficiency for memory of many variables vs one array). Array is much more useful to have the contiguous area.

score 1 · Answer 8 · answered May 18 '13 at 13:21

You are allocating more than just an int, you are also allocating a heap block which has overhead (which varies by platform). Something needs to keep track of the heap information. If you instead allocated an array of ints, you'd see memory usage more in line with your expectations.

score 1 · Answer 9 · answered May 18 '13 at 13:32

In addition to the alignment and overhead issues mentioned in the other questions, this may be due to the way the C++ runtime requests process memory allocations from the OS.

When the process's data section fills up, the runtime has to get more memory allocated to the process. It might not do this in equal-sized chunks. A possible strategy is that each time it requests memory, it increases the amount that it request (maybe doubling the heap size each time). This strategy keeps the memory allocation small for programs that don't use much memory, but reduces the number of times that a large application has to request new allocations.

Try running your program under strace and look for calls to brk, and note how large the request is each time.

C++ allocates abnormally large amout memory for variables

9 Answers9

Linked

Related