UnicodeEncodeError: 'ascii' codec can't encode characters in position 23-24: ordinal not in range(128)

Question

I am trying to parse csv that contains the text below text (test.csv)

"legalgroup_text"   "Aktiebolag"    "Aktiebolag"    "LGAKTIEBOLAG"
"legalgroup_text"   "Allmän försäkringskassa"   "Allmän försäkringskassa"   "LGALLMAENFOERSAEKRINGSKASSA"

I am using encoding iso-8859-1, since the file contains swedish character

import codecs
import csv

with codecs.open('test.csv', encoding='iso-8859-1') as label_file:
    data = csv.reader(label_file, delimiter='\t')
    for row in data:
        print(row)

I am getting error

Traceback (most recent call last):
File "/mnt/ashraful/PycharmProjects/Test/test.py", line 6, in <module>
    for row in data:
UnicodeEncodeError: 'ascii' codec can't encode characters in position 23-24: ordinal not in range(128)

I also tried with encoding utf-8, but getting error

UnicodeEncodeError: 'ascii' codec can't encode character u'\xe4' in position 25: ordinal not in range(128)

2nd line, specially `ä` character – Ashraful Islam Apr 19 '18 at 05:46 — Ashraful Islam, Apr 19 '18 at 05:46
This line `for row in data:` – Ashraful Islam Apr 19 '18 at 05:47 — Ashraful Islam, Apr 19 '18 at 05:47
You really should migrate to Python 3 if you can. – Burhan Khalid Apr 19 '18 at 05:53 — Burhan Khalid, Apr 19 '18 at 05:53

score 0 · Accepted Answer · answered Apr 19 '18 at 05:53

0

Give up on Python 2 and use Python 3. That by itself (no other changes, I just tested) will fix the issue.

answered Apr 19 '18 at 05:53

tel

13,005
2
44
62

Why people cling to python 2 and complain about unicode errors is a mystery! – tdelaney Apr 19 '18 at 05:58
2

@tdelaney I have sympathy. Some people do have legit reasons for still using py2. Unicode support ain't one of them, though – tel Apr 19 '18 at 06:01
2

I'm getting this error on python 3 – Thirupathi Thangavel Jun 06 '18 at 10:02
1

This does not always fix encoding errors, plus moving to Python 3 isn't always possible or realistic for some environments yet. – zachhilbert Apr 08 '19 at 17:40

UnicodeEncodeError: 'ascii' codec can't encode characters in position 23-24: ordinal not in range(128)

1 Answers1