Over ten years ago, there was a great question about the pros and cons of the CUDA Driver API vs Runtime API.
A lot of time has obviously passed, and I was wondering how much things have changed. The old question had a great answer about things that were problematic with the runtime API if you used multiple threads that interfaced with the API.
Is that still problematic? What do most folks writing code for GPUs use now? I'm just starting with CUDA and am wondering if I should prefer one API over the other. I am using multiple threads and contexts, if that makes a difference.