CUDA: How to assert in kernel code?

Question

What is the equivalent technique of an assertion in CUDA kernel code?

There does not seem to be an assert for CUDA kernel code. I want a way to catch programmer mistakes easily in kernel code. A mechanism where I can set conditions that need to be true and the kernel should bail out when the condition is false with an error message.

For any one who comes across this via Google, as I did, **asserts in kernel code are now possible:** http://docs.nvidia.com/cuda/cuda-c-programming-guide/index.html#assertion — Sam, Mar 27 '14 at 11:51
@Sam, why don't you post this as an answer? It's hard to notice the comment: the other answers distract. — Serge Rogatch, Sep 16 '16 at 08:40
@SergeRogatch [Here](https://stackoverflow.com/a/31526218/311567) it is as an answer, but even that is not the accepted one! Problem with old questions I suppose, that need new answers. — dashesy, Mar 12 '18 at 23:12

score 6 · Accepted Answer · edited Apr 17 '20 at 11:21

For devices of cc 2.x or above, assertion , void assert(int expression), could be used within a kernel such that threads with expression == 0 send a message to stderr once a host synchronization function is called.

For other cases or when assertion cannot be used (e.g. on MacOS), you won't be able to return an error message or error code to the host from a kernel.

Instead, I would set a error state and check it from the host. Use device global memory or (better) mapped host memory for storing an error state, passed as a parameter to each kernel call. Use if statements in the kernel, and of if the statements fail, set the error code and return. You will be able to check the error code from the host after the kernel call, but keep in mind that you will have synchronize the host and device after the kernel launch before checking the error code. I guess this will work fine for development but not so much for production.

As to printing an error message straight from the device

In 1.x, 2.x, and 3.0 cards, you can use emulation mode to print an error message.
In 3.1 forward (on fermi), apparently you can use printf in kernels to print the error message. It appears that it doesn't always work right away, e.g. http://forums.nvidia.com/index.php?showtopic=182448

score 4 · Answer 2 · edited May 23 '17 at 12:18

I would like to point out that an assert may occur in one thread only, but if you decide to early terminate that thread its absense may cause other bugs (and probably other asserts) happening later; possibly leading to a complete kernel crash and loose of all information on the GPU.

Also, the answer given at " Using assert within kernel invocation " will work only if the assert is used directly in the __ global__ function and not deeper, somewhere inside __ device__ function.

My suggestion is, that even an assert fails, you proceed normally with your code, but leave an error message. You can use mapped, pinned memory (you map host RAM memory into GPU address space) to store error codes/messages. That way, even if your kernel crashes and GPU is reset, you are likely to obtain valuable information in that mapped memory. If I am not mistaken, mapped, pinned memory is supported by almost all devices of Compute Capability 1.1 and higher.

score 3 · Answer 3 · edited May 23 '17 at 12:02

3

You may find this helpful:

Using assert within kernel invocation

Alternatively you can catch cudaError using cudaThreadSynchronize() which gives you one of about 40 different reasons for kernel returning an error. But mostly you can check those conditions using if/else commands in the kernel.

edited May 23 '17 at 12:02

Community

1
1

answered Feb 25 '11 at 07:43

jwdmsd

2,107
2
16
30

Jawad: Just returning from the kernel on assert failure is not very useful. Using printf() however is useful. Thanks! :-) – Ashwin Nanjappa Feb 25 '11 at 07:48
@Ashwin - well, you cannot printf from the kernel and there's no way around that. – jmilloy Feb 25 '11 at 12:11
Can you set cudaError to a custom value in a kernel? – jmilloy Feb 25 '11 at 12:13
2

Fermi allows printf within the kernel. Other than that you can make your custom enum and return as many different errors as you want. – jwdmsd Feb 25 '11 at 12:53

CUDA: How to assert in kernel code?

3 Answers3

Linked