I have already tried pref-stat, papi, valgrind. these tools only extract the number of cache misses for a program.
Is it possible to monitor whole cache and record the time of cache miss or hit?
I have already tried pref-stat, papi, valgrind. these tools only extract the number of cache misses for a program.
Is it possible to monitor whole cache and record the time of cache miss or hit?
Did you see this:
https://perf.wiki.kernel.org/index.php/Tutorial#Live_analysis_with_perf_top
And you can print these on the web screen with simple tricks.
In VTune Amplifier Memory access analysis you can see DRAM bound (due to bandwidth or latency) metric which will give you approximation of cycles affected due to memory problems. You can also see average latency there of memory operations.