can someone help me with how to loop through a CUDA kernel to average the corner pixels, border pixels and/or middle pixels of a Mat image?
I know that for the middle pixels I should do this:
(src[threadIdx.x][ threadIdx.y] +
src[threadIdx.x -1][ threadIdx.y -1]+
src[threadIdx.x -1][ threadIdx.y]+
src[threadIdx.x -1][ threadIdx.y + 1]+
src[threadIdx.x][ threadIdx.y - 1]+
src[threadIdx.x][ threadIdx.y + 1]+
src[threadIdx.x +1][ threadIdx.y - 1]+
src[threadIdx.x + 1][ threadIdx.y]+
src[threadIdx.x+ 1][ threadIdx.y +1])/9.0;
right? what about the corner or the border pixels? How can you take care of the boundaries?