I am trying to do some analysis of the MODIS satellite data. My code primarily reads a lot of files (806) of the dimension 1200 by 1200 (806*1200*1200). It do it using a for loop
and perform mathematical operations.
Following is the general way in which I read files.
mindex=np.zeros((1200,1200))
for i in range(1200):
var1 = xray.open_dataset('filename.nc')['variable'][:,i,:].data
for j in range(1200):
var2 = var1[:,j]
## Mathematical Calculations to find var3[i,j]##
mindex[i,j] = var3[i,j]
Since its a lot of data to handle, the process is very slow and I was considering parallelizing it. I tried doing something with joblib
, but I have not been able to do it.
I am unsure how to tackle this problem.