I defined, that array size is 13107200. I try to run kernel with 25600 blocks and 512 threads. In system info defined, that Max_Grid_Dim_X = 2147483647 and Max_Block_Dim_X = 1024. Nvidia driver keep crashing while the program runs. How to fix it?
__global__ void aesEncrypt(node *data, int rounds)
{
int index = blockDim.x * blockIdx.x + threadIdx.x;
addRoundKey(&data[index], 0);
for(int i = 1; i <= rounds; i ++) {
subBytes(&data[index]);
shiftRows(&data[index]);
if(i != rounds)
mixColumns(&data[index]);
addRoundKey(&data[index], i);
}
}
aesEncrypt<<<dim3(25600, 1, 1), dim3(512, 1, 1)>>>(gData, r);