I have some code and would like to optimize the L1 cache miss/hit ratio. Is a way to see the cache hit/miss in memory profiling in Python ?
There are tools in C++ like this: Measuring Cache Latencies
EDIT : It may include compiled variant of Python like Cython / Numba (JIT)