I have found this post but am still unclear on what the redzone_checker
kernel is doing and why. Specifically, should it be taking > 90% of my application's runtime? TensorBoard reports that it is taking the vast majority of the runtime of my JAX code, and I'd like to know
- Is it actually the case that this kernel is taking way too much time, or is this a side effect of profiling JAX with TensorBoard (i.e., the output is misleading in some way)?
- Is there a way to reduce the amount of time taken by the
redzone_checker
kernel? Is that even a good idea?
Thanks in advance for any insights.