I am trying to download a webpage using urlib2 in python.
response = urlopen(url, timeout=10)
html = response.read()
html[170:198]
print html[170:198]
But the 'á' character present in it is encoded as '\u0e41', which is noThai Character Sara Ae, so far as I understand.
Cadeia Acion\u0e41ria da Empresa
Cadeia Acionแria da Empresa
The output of the print command should be:
Cadeia Acionária da Empresa
Can someone tell me what am I doing wrong?