I have an application that creates 2 threads in additional to the main.
The latter 2 threads will be moved to core_id_2 and core_id_3 (via pthread_setaffinity_np
).
When I ran taskset -c [core_id_1] perf mem record -c [core_id_2] [executable]
and then perf mem report
.
It shows that the application had zero memory load operations while having 14M store operations.
Did I run the command incorrectly? The application was reading a few GB file in a tight loop. Not sure why it recorded zero.
I ran htop in parallel to confirm that core_id_2 was indeed running the application (taking up 100% CPU).
Any suggestion?
Was running on a kernel 3.10.0-1062 machine.