I'm trying to design a benchmark for a low-latency workload (each operation is in the hundreds of nanos median). I was curious about the fidelity of designing a benchmark using rdtsc() and timestamp counter measurement.
What are the drawbacks or potential problems with this approach?