3

I have a string that looks like this.

st = '/M\xe4rzen'

I would like to covert this to unicode. How can I do this? I've tried:

st.decode('utf-8')
unicode(t, 'utf-8')

The original file is utf-8 encoded, but I can't seem to get the unicode representation of the string.

Martijn Pieters
  • 1,048,767
  • 296
  • 4,058
  • 3,343
user1728853
  • 2,607
  • 4
  • 30
  • 31

1 Answers1

11

Your data is not UTF8 encoded; more likely it is using the Latin-1 encoding:

>>> print st.decode('latin1')
/Märzen

Calling .decode() is enough, no need to also call unicode().

Martijn Pieters
  • 1,048,767
  • 296
  • 4,058
  • 3,343