4

On device with compute capability <= 7.2 , I always use

nvprof --events shared_st_bank_conflict

but when i run it on RTX2080ti with CUDA10 , it returns

Warning: Skipping profiling on device 0 since profiling is not supported on devices with compute capability greater than 7.2

So how can i detect whether there's share memory bank conflict on this devices ?

I've installed Nvidia Nsight Systems and Nsight Compute , find no such profiling report...

thks

einpoklum
  • 118,144
  • 57
  • 340
  • 684
LiMou
  • 41
  • 3

2 Answers2

2

You can use --metrics:

Either

nv-nsight-cu-cli --metrics l1tex__data_bank_conflicts_pipe_lsu_mem_shared_op_ld.sum

for conflicts when reading (load'ing) from shared memory, or

nv-nsight-cu-cli --metrics l1tex__data_bank_conflicts_pipe_lsu_mem_shared_op_st.sum

for conflicting when writing (store'ing) to shared memory.

einpoklum
  • 118,144
  • 57
  • 340
  • 684
Deepak C U
  • 21
  • 4
1

It seems this is a problem, and is addressed in this post to the NVIDIA forums. Apparently it should be supported using one of the Nsight tools (either the CLI or UI).