Does new/malloc or delete/free occupy or invalidate cache lines?

Question

I am curious about the cache behavior. Here are some questions related to the cache below :

Does write operation bring the data into the cache? Considering an assignment like A[i] = B[i], will A[i] be loaded into the cache?Since I just write something into A[i] instead of reading its value.
When allocating large memory, the memory may come from the OS. And the OS will initialize the data to zero for safety reason (Reference ). If assignments will bring data into cache(question 1) , will this mechanism occupy the cache ?
Assume that there is an allocated array B, and the entire B is now in the cache. Will the cache lines occupied by B become invalid(available) right after I free array B?

Could somebody gives me a hint?

I would say this is not something the language has control over. To the language, all of this is abstracted as 'memory'. See [this](http://www.programcreek.com/2012/10/how-is-an-array-stored-in-memory-and-brought-to-cache/) though. — Arjun Sreedharan, Jun 24 '15 at 10:00
@ArjunSreedharan: The compiler does know the OS and CPU. It might add cache control instructions. — MSalters, Jun 24 '15 at 13:07

score 3 · Accepted Answer · 2015-06-24T14:38:25.677

From here https://people.freebsd.org/~lstewart/articles/cpumemory.pdf

--

Does write operation bring the data into the cache?

From the article:

By default all data read or written by the CPU cores is stored in the cache. There are memory regions which cannot be cached but this is something only the OS implementers have to be concerned about; it is not visible to the application programmer. There are also instructions which allow the programmer to deliberately bypass certain caches. This will be discussed in section 6.

--

When allocating large memory, the memory may come from the OS. will this mechanism occupy the cache?

Possibly not. It will occupy the cache only after reading or writing data. From the article:

On operating systems like Linux with demand-paging support, an mmap call only modifies the page tables ... No actual memory is allocated at the time of the mmap call.

The allocation part happens when a memory page is first accessed, either by reading or writing data, or by executing code. In response to the ensuing page fault, the kernel takes control and determines, using the page table tree, the data which has to be present on the page. This resolution of the page fault is not cheap, but it happens for every single page which is used by a process.

--

3 .Assume that there is an allocated array B, and the entire B is now in the cache. Will the cache lines occupied by B become invalid(available) right after I free array B?

from the article invalidation of a cache line happens only when there has been a write operation on another CPU

What developed over the years is the MESI cache coherency protocol (Modified, Exclusive, Shared, Invalid). The protocol is named after the four states a cache line can be in when using the MESI protocol. ... If the second processor wants to write to the cache line the first processor sends the cache line content and marks the cache line locally as Invalid.

And also cache line can be evicted:

Another detail of the caches which is rather uninteresting to programmers is the cache replacement strategy. Most caches evict the Least Recently Used (LRU) element first.

And from my experience with TCMalloc free() is not a compelling reason to evict the memory from a cache. On the contrary it might be harmful to performance. On free() TCMalloc just puts a freed block of memory in its cache. And this block of memory will be returned by malloc() when an application asks for a block of memory next time. That is the essence of a caching alliocator like TCMalloc. And if this block of memory is still in cachethen it is even better for performance!

Just in addition to 3): In theory, free/delete could explicitly issue cache invalidation instructions/requests - I've never seen any implementation do that though. — nos, Jun 24 '15 at 14:42

marom · Answer 2 · 2015-06-24T11:52:33.590

This is an interesting article where you will find more info (probably too much) about what you are asking about:

What every programmer should know about memory

Regarding your question every operation you do on memory will be cached. As a programmer you have no control whatsoever about it (and eves the OS has not). Keep in mind that if you need to implement a space(memory) consuming algorithm then try to increase memory locality.

So assuming you have to deal with 1GB of data try to split up computation into clusters (sections of data) and try to do all operations one section at a time. This way you will actually use data from cache and not need to get to outer memory every time. This may give you a performance boost.

Actually the OS very much has control over the cache, and necessarily so. It would be quite bad if the CPU cache contained leftover data from the previously running process. And since the CPU doesn't know about processes, this must be OS-enforced. — MSalters, Jun 24 '15 at 13:10

score 1 · Answer 3 · answered Jun 24 '15 at 13:20

To answer the title question, no. These operations do not invalidate caches, and quite intentionally so. What happens to freed memory? There are two important cases. One, the memory is immediately recycled for your programs next allocation. Having the address still in cache is efficient in this case, as this reduces the number of writes to main memory.

But even if the memory isn't directly reused, the memory allocator may merge free blocks behind the scenes or perform other operations. This often involves writing to the space formerly allocated by your data. After all, freeing memory doesn't physically destroy the memory. It merely sets the owner. After delete, ownership is transferred from your program to the runtime or the OS.

Does new/malloc or delete/free occupy or invalidate cache lines?

3 Answers3