Is there a way to do a parallel forward fill on multiple columns in a Pandas DataFrame or a Numpy ndarray?

Asked Nov 11 '19 at 20:22

Active Nov 11 '19 at 20:22

Viewed 250 times

I have a pandas DataFrame and would like to find a way to speed up ffill and bfill operations on multiple columns. What methods exist to do this kind of operation on multiple columns in parallel?

One alternative would be using numpy's structured arrays and then JIT'ing the code, operating on each column using numba.prange. This requires writing efficient ffill and bfill operations in numpy.
Is there another way to make this operation parallel using possibly dask or some other parallelization technique?

asked Nov 11 '19 at 20:22

DonQuixote

Related - https://stackoverflow.com/questions/41190852 – Divakar Nov 11 '19 at 20:33
Have you done any benchmarks? How do you know that `ffill()` and `bfill` are problematic and would benefit from parallelization? – AMC Nov 11 '19 at 21:32

Is there a way to do a parallel forward fill on multiple columns in a Pandas DataFrame or a Numpy ndarray?

0 Answers0