I have folder of json files (approx 70 GB data), these json files are emails. I want to open all the files and find Unicodes using python. later I want to replace those Unicodes with any regular expression. Could you please provide a layout that I can follow through?
I am doing this to get rid of the error:
ValueError: Unpaired high surrogate when decoding 'string' on reading json file
I understand that this question might sound bit vague but please free to ask any doubts regarding the question.
Any help would be much appreciated :)