I was wondering how I would go about using __cos(x)
(and respectively __sin(x)
) in the kernel code with CUDA. I looked up in the CUDA manual that there is such a device function however when I implement it the compiler just says that I cannot call a host function in the device.
However, I found that there are two sister functions cosf(x)
and __cosf(x)
the latter of which runs on the SFU and is overall much faster than the original cosf(x)
function. The compiler does not complain about the __cosf(x)
function of course.
Is there a library I'm missing? Am I mistaken about this trig function?