How to read whole line in pyspark with charset？

Asked Jun 04 '19 at 06:43

Active Jun 04 '19 at 06:43

Viewed 59 times

I'm using pyspark to read textfiles which are encoded by gbk. So how can i use pyspark to read this files by gbk.

read.csv() can't read whole line, hadoopfile() only support 'utf-8'

asked Jun 04 '19 at 06:43

cxco

You can go through this answer: https://stackoverflow.com/questions/904041/reading-a-utf8-csv-file-with-python – C_codio Jun 04 '19 at 07:08
use pyspark,not only python – cxco Jun 04 '19 at 10:46

0 Answers0