I have a simple DNN and I want to measure the GPU prediction time. I do not care about I/O events and data transfer and only care about the time model.predict() takes to complete on the GPU. I am using TensorFlow 2.5. I have tried using pythons time module, but I do not think that this is the correct way.
Is there a way I can measure that time?