Is it possible to partition a pandas dataframe to do multiprocessing?
Specifically, my DataFrames are simply too big and take several minutes to run even one transformation on a single processor.
I know, I could do this in Spark but a lot of code has already been written, so preferably I would like to stick with what I have and get parallel functionality.