I'm using pyspark to read textfiles which are encoded by gbk. So how can i use pyspark to read this files by gbk.
read.csv() can't read whole line, hadoopfile() only support 'utf-8'
I'm using pyspark to read textfiles which are encoded by gbk. So how can i use pyspark to read this files by gbk.
read.csv() can't read whole line, hadoopfile() only support 'utf-8'