I expected that GTX 680 (which is one of the latest version of GPUs) is capable of concurrent data transfer (concurrent data transfer in both direction). But when I run cuda SDK "Device Query", the test result of the term "Concurrent copy and execution" is "Yes with 1 copy engine", and it means that the GPU can not do concurrent data transfer.
I wonder if my testing result also happens to you? And can you share with me which devices are capable of concurrent data transfer?
Thanks!