Occasionally i have a string which says it's a unicode, but in fact it's not. It's like this:
s = u'\xe8\xaf\xb8\xe8\x91\x9b\xe4\xba\xae'
It's in fact just a bytestring with a 'u' in front of it. Don't know how to fix this. When i try to convert it to a real unicode with unicode(s, 'utf8'), code fails, because it's already been a unicode. Decoding with s.decode('utf8') fails with UnicodeEncodeError too.