heap management

Question

I know there is a meta-data which stores auxiliary information which is used during free(), realloc() when we supply only the pointer.

I have few doubts about heap.

Stack is allocated per-process. There is no doubt in that, but not sure about heap. Whether a heap information is globally maintained else per-process there will be some mechanism which holds info about allocated memory for that particular process.
How heap information will be maintained? I guess hashing mechanism. I googled and tried SO also. most of them explained it as implementation specific.. like that.

Since you're using 'process/thread' to mean the same thing: A process is different from a thread. [While each thread has its own stack, threads share memory, including the heap](http://stackoverflow.com/q/1762418/274261). Processes don't. — ArjunShankar, Jul 16 '12 at 09:30
k @ArjunShankar. leave thread. please consider process. my objective is to know how heap is done. — Jeyaram, Jul 16 '12 at 09:41
Alright. I suggest you consider editing your question then. Already one answer has been written because of the confusion caused by the use of 'thread/process'. — ArjunShankar, Jul 16 '12 at 09:51

Damon · Accepted Answer · 2012-07-16T10:15:14.880

The heap, like the stack, is a per-process, and an (almost) purely user-space thing.

The heap manager uses the sbrk syscall to inform the operating system that it intends to grow the amount of memory needed. This does little apart from change a range of pages from "not known" to "existing, zero, never accessed" (this means in reality that they still don't exist, but the OS pretends they do). When a page is first accessed, it faults, and the OS pulls a zero page from the zero pool.
(It can be slightly more complex, since the the heap manager might also shrink the data segment if a lot of memory has been released from the top, but basically it's as simple as that).

That's already all the OS knows about the heap. Everything else, such as splitting blocks, putting freed blocks onto a list or similar structure, and reusing blocks happens inside the heap manager, which is directly or indirectly (e.g. as part of the glibc) part of your program.

What exactly the heap manager is doing is implementation dependent, there exist at least half a dozen well-known different malloc implementations which work in different ways. See for example this one or this one or this one.

The stack works in the same or a similar way. A certain memory range is initially "reserved" without actually reserving anything (that is, without creating pages). A few pages are committed (that is, created) and the last page is either write protected or non-existent and this is remembered in a special way. When the stack grows so this last page is touched, a fault happens. A new page is then pulled from the zero pool, provided that the stack still is within its allowable size.

When the process terminates, all references to those pages are removed and (assuming they are not shared with another process that still holds a reference) handed over to a low priority background task which zeroes them out and adds them to the "zero pool".

score 2 · Answer 2 · answered Jul 16 '12 at 09:29

2

Yes, the heap is per-process as well.

There are heap implementations out there that use per-thread heaps in order to reduce lock contention, but that's an implementation detail of the (user-space) heap manager.

answered Jul 16 '12 at 09:29

janneb

36,249
2
81
97

score 0 · Answer 3 · answered Feb 18 '13 at 12:07

0

Read this, it's a bit old ('07) but has the relevant details outlined by Damon

http://www.blackhat.com/presentations/bh-usa-07/Ferguson/Whitepaper/bh-usa-07-ferguson-WP.pdf

answered Feb 18 '13 at 12:07

Pepe

405
5
12

heap management

3 Answers3

Linked