I am having some troubles reading the following CSV data in UTF-16
:
FullName, FullLabel, Type
TEST.slice, "Consideration":"Verde (Spar Verde, Fonte Verde)", Test,
As far as I understand, it should not be a problem for a reader, since there is a quote
parameter to handle that.
df = spark.read.csv(file_path, header=True, encoding='UTF-16', quote = '"')
However, this would still give me an incorrect split:
Is there some way to handle those cases or do I need to work it around with RDD?
Thank you in advance.