0

I recieve HTML-files and they contain Strings like that " ("), ü(ΓΌ) and so on.

I need them humand-readable. So I could use str.replace() for that. But isn't there a package/library for Python3 which knows all character-codes by itself and could handle that?

buhtz
  • 10,774
  • 18
  • 76
  • 149

2 Answers2

3

You can use html.unescape():

import html
print(html.unescape('"ü'))
u32i64
  • 2,384
  • 3
  • 22
  • 36
1

Se the solution here. It's called decode (or unescape) and yes there is a library for that.

dben
  • 484
  • 1
  • 6
  • 21