I am currently adding to a h5py array, having defined an array as:
f = h5py.File(batch_path,'w')
data = f.create_dataset('data_set',(525600,1300),dtype=np.float32)
and adding arrays to it as:
for index,file in enumerate(files):
df = pd.read_csv(file)
result = np.array(list(map(lambda x: float(x.split(';')[1]),df.as_matrix()[:,0])))
data[:,index] = result[:]
However, the last step (data[:,index] = result[:]) takes an incredible amount of time. What is wrong here?