I want to allow users to define functions to be used in CUDA kernels (or be called by CUDA kernels).
I don't want to expose CUDA API to the users. The functions should look like typical c++/lua functions.
I've checked pyCUDA, but it seems to only be wrapper around .cu code.
I'd rather have a .lua or .cc file and use function pointers. Is it remotely possible?