I am new to CUDA and need help understanding some things.From this i understood 2 things.
- 6144 is the instantaneous capacity of my GPU. So that while trying some multiprocessing code, i could run up to only 6144 thread? Is that correct?
- I could create a kernel of any size (In Cuda C programming). With any block size and number of threads.
The device query is shown here. It will be a pleasure if i got clear clarification on these two.
Still i am confused about how many threads and blocks are there in my GPU, and also how many threads could i used to run for a task (max and min limit).