EDITED: I have a code which looks like:
__author__ = 'feynman'
cimport cython
@cython.boundscheck(False)
@cython.wraparound(False)
@cython.nonecheck(False)
def MC_Surface(volume, mc_vol):
Perm_area = {
"00000000": 0.000000,
"11111111": 0.000000,
...
...
"11100010": 1.515500,
"00011101": 1.515500
}
cdef int j, i, k
for k in range(volume.shape[2] - 1):
for j in range(volume.shape[1] - 1):
for i in range(volume.shape[0] - 1):
pattern = '%i%i%i%i%i%i%i%i' % (
volume[i, j, k],
volume[i, j + 1, k],
volume[i + 1, j, k],
volume[i + 1, j + 1, k],
volume[i, j, k + 1],
volume[i, j + 1, k + 1],
volume[i + 1, j, k + 1],
volume[i + 1, j + 1, k + 1])
mc_vol[i, j, k] = Perm_area[pattern]
return mc_vol
In the hope to speed up, it was modified to:
{
...
...
"11100010": 1.515500,
"00011101": 1.515500
}
keys = np.array(Perm_area.keys())
values = np.array(Perm_area.values())
starttime = time.time()
tmp_vol = GetPattern(volume)
print 'time to populate the key array: ', time.time() - starttime
cdef int i
starttime=time.time()
for i, this_key in enumerate(keys):
mc_vol[tmp_vol == this_key] = values[i]
print 'time for the loop: ', time.time() -starttime
return mc_vol
def GetPattern(volume):
a = (volume.astype(np.int)).astype(np.str)
output = a.copy() # Central voxel
output[:, :-1, :] = np.char.add(output[:, :-1, :], a[:, 1:, :]) # East
output[:-1, :, :] = np.char.add(output[:-1, :, :], a[1:, :, :]) # South
output[:-1, :-1, :] = np.char.add(output[:-1, :-1, :], a[1:, 1:, :]) # SouthEast
output[:, :, :-1] = np.char.add(output[:, :, :-1], a[:, :, 1:]) # Down
output[:, :-1, :-1] = np.char.add(output[:, :-1, :-1], a[:, 1:, 1:]) # DownEast
output[:-1, :, :-1] = np.char.add(output[:-1, :, :-1], a[1:, :, 1:]) # DownSouth
output[:-1, :-1, :-1] = np.char.add(output[:-1, :-1, :-1], a[1:, 1:, 1:]) # DownSouthEast
output = output[:-1, :-1, :-1]
del a
return output
This takes longer for a 3D array of size say 500^3. Here, tmp_vol 3D array of strings. For example: if say tmp_vol[0,0,0] = "00000000" then mc_vol[0,0,0] = 0.00000. Alternatively, I can get rid of mc_vol and write if tmp_vol[0,0,0] = "00000000" then tmp_vol[0,0,0] = 0.00000.
Here, The for-loop takes a lot of time, I see that only one CPU is used. I tried to map them in parallel using map and lambda but ran into errors. I am really new to python so any hints will be great.