I am having little trouble in double parallelism in CUDA. i have a file named First.cu and a main file named main.cpp
I was getting error.
error : calling a __global__ function("kernel_6") from a __global__ function("kernel_5") is only allowed on the compute_35 architecture or above>
then I followed this thread and configured my properties like this again.
View -> Property Pages
Configuration Properties -> CUDA C/C++ -> Common -> Generate Relocatable Device Code -> Yes (-rdc=true
)Configuration Properties -> CUDA C/C++ -> Device -> Code Generation -> compute_35,sm_35
Configuration Properties -> Linker -> Input -> Additional Dependencies -> cudadevrt.lib
now I am getting error like this.
nvcc : fatal error : nvcc supports '--relocatable-device-code=true (-rdc=true)', '--device-c (-dc)', and '--device-link (-dlink)' only when targeting sm_20 or higher
Please help me. What should I do to remove this error?
I have added all the required CUDA Libraries. I installed CUDA 5.5 and I have GTX780 .