In continuation of my previous question. Is CUDA suitable for real-time quick applications? The task is: I need my application to make a lot of calculations in 0.1-0.3 ms. CUDA kernels cope with these calculations in a very good time suitable for my project, but with all the overheads I get (memory copy) the time is not acceptable.
Is CUDA just not usable for this kind of applications or there are some hacks to avoid sutuations described in my previous question?
These guys provide so called "GPU Workbench" with the modified gpu driver built on their own linux verson. They say that their system performs much faster then typical GPU configuraions. Anyone knows about them?