PySpark: Converting Spark Dataframe to Pandas Dataframe [alternative for .toPandas()]

Asked Jun 14 '18 at 10:32

Active Jun 15 '18 at 11:40

Viewed 391 times

I have a huge spark data frame with many columns (PySpark). [number of columns around 100 and number of rows more than 5000000]. I want to convert this data frame into Pandas data frame. However, by df.toPandas() is not efficient, since it takes lots of time.

Any help on this please?

edited Jun 15 '18 at 11:40

asked Jun 14 '18 at 10:32

Saeid SOHEILY KHAH

PySpark: Converting Spark Dataframe to Pandas Dataframe [alternative for .toPandas()]

0 Answers0