The documentation available here mentions that data available at the specified address is brought from memory to the cache line (the cache level provided as a hint).
However, I am confused whether or not the LLC is also accessed (assuming the hint specifies L1D or L2) or is it the case that the memory is always accessed - irrespective of whether or not the data may be available in LLC.
The reason I'm asking this is that, in certain experiments of mine, I've found that using _mm_prefetch
intrinsic has increased my LLC-loads count (perf event), even though I'm getting an overall performance benefit.