1

I have a huge spark data frame with many columns (PySpark). [number of columns around 100 and number of rows more than 5000000]. I want to convert this data frame into Pandas data frame. However, by df.toPandas() is not efficient, since it takes lots of time.

Any help on this please?

Saeid SOHEILY KHAH
  • 747
  • 3
  • 10
  • 23

0 Answers0