I am trying to read Non-Ascii characters from a csv in pysaprk, specifically the csv contains names of countries in spanish, so I have ESPAÑA
(SPAIN in spanish), but it reads ESPA�OLA
this is the code I am using:
df = sqlContext.read.csv("path", sep=",", header=True )
I can't find all the encodings that sqlContext.read
accepts, i was trying to use latin-1
but I get a message that it is not supported