I was wondering what was the best manner to benchmark a GPU with CUDA over a specific application to obtain an accurate plot of GFLOPs/Watt.
So I would like to run a Kernel and get FLOP/Watt data to plot, and how I could do it efficiently.
Thanks
I was wondering what was the best manner to benchmark a GPU with CUDA over a specific application to obtain an accurate plot of GFLOPs/Watt.
So I would like to run a Kernel and get FLOP/Watt data to plot, and how I could do it efficiently.
Thanks