I used the script from this post that extracts text from RTF files.
When I output the result into a text file I see a lot of "NEL" when viewing it in notepad++.
How can I replace "NEL"? For "NUL" I managed to remove it using replace(u'\x00', u' ')
I used the script from this post that extracts text from RTF files.
When I output the result into a text file I see a lot of "NEL" when viewing it in notepad++.
How can I replace "NEL"? For "NUL" I managed to remove it using replace(u'\x00', u' ')
NEL is Newline. It should be U+0085
.
If your editor is showing all of your text on one line with NEL
in the middle of text, you could replace(u'\x85', u'\n')
(or u'\r\n'
depending on your system).