I am in some trouble in arranging the threads according to my 2D data array.
It is a compact array where every integer contains 32 bit values [1000110001000000010000000000010]
representing transactions and I need to count the bits row wise(I have used integer instead of bit vector/bitset). Array is of dimension 1000*3125
. Every row contains 1 lakh bit values.
I need to count total bits that are set to 1 for each row ie. for 3125 columns in each row. How should I arrange the threads/ loop for optimum performance?