1

I'm getting pandas dataframe from dask using

p_df_data=d_df_data.compute()

But this is really slow... Is there an alternative method?

saeedar
  • 305
  • 2
  • 19
  • This really depends on what transformations you are applying. Related: [why is multiprocessing slower than a simple computation in Pandas?](https://stackoverflow.com/questions/49837539/why-is-multiprocessing-slower-than-a-simple-computation-in-pandas) – jpp Sep 28 '18 at 12:22

1 Answers1

2

Dask dataframes are lazy, all operations are free until you call compute, at which point they all occur.

MRocklin
  • 55,641
  • 23
  • 163
  • 235