0

I used the script from this post that extracts text from RTF files.

When I output the result into a text file I see a lot of "NEL" when viewing it in notepad++.

How can I replace "NEL"? For "NUL" I managed to remove it using replace(u'\x00', u' ')

mtryingtocode
  • 939
  • 3
  • 13
  • 26
  • What is NEL? Is it just three capital letters? – DYZ Aug 16 '17 at 04:16
  • It's some sort of special character or symbol. Similar on how linefeeds and carriage returns are displayed in notepad++ – mtryingtocode Aug 16 '17 at 04:33
  • What's its Unicode code? How does it look like in notepad? Do you have any references to it? I have never heard of it, and Google doesn't know about it, either. – DYZ Aug 16 '17 at 04:35

1 Answers1

2

NEL is Newline. It should be U+0085.

If your editor is showing all of your text on one line with NEL in the middle of text, you could replace(u'\x85', u'\n') (or u'\r\n' depending on your system).

robert_x44
  • 9,224
  • 1
  • 32
  • 37