Suppress 'u' in Output - Python

Question

How can i get rid of these u in output?

Regex:

Tregex1 = "1?\W*([2-9][0-8][0-9])\W*([2-9][0-9]{2})\W*([0-9]{4})(\se?x?t?(\d*))?"

Code:

for a in re.findall(Tregex1,text_value,re.IGNORECASE):
        print a

Output:

(u'877', u'638', u'7848', u'\n', u'')
(u'650', u'627', u'1000', u'\n', u'')
(u'650', u'627', u'1001', u'\nE', u'')
(u'312', u'273', u'4100', u'', u'')

I tried using these & followed several similar links

a.encode('ascii', 'ignore')
a.encode('utf-8')
",".join(a)

But none of them are working.

Expected Output:

877-638-7848
650-627-1000
650-627-1001
312-273-4100

I am using Python 2.7

Also can someone explain, why i am getting sometimes \n while \nE otherwise or even blank?

You do not have to worry about the `u` prefix, it only tells you the strings are Unicode. — Wiktor Stribiżew, Jun 20 '16 at 10:18

score 2 · Accepted Answer · edited May 23 '17 at 12:22

try this:

for a in re.findall(Tregex1,text_value,re.IGNORECASE):
    print '-'.join(a[:3])

the u just tells you that it's a unicode string.

the (..., ...,) is the representation of the tuples

what '-'.join(...) does is connect the strings of ... with a -

a[:3] means "only the first three elements of a"

(for a good explanation of the slicing notation in python look here: https://stackoverflow.com/a/509295/327293)

score 1 · Answer 2 · answered Jun 20 '16 at 10:20

1

Your problem is not the u. If you want to format your results in a specific way, you should use the string formatting functions.

print '-'.join(a)

answered Jun 20 '16 at 10:20

Daniel Roseman

588,541
66
880
895

Dan · Answer 3 · 2016-06-20T10:23:23.990

1

The u just means it is unicode. You can recode it as you wish. This will work, and also skip the blank values:

a = (u'877', u'638', u'7848', u'\n', u'')
print "-".join([x.strip() for x in a if x.strip() != u""])

877-638-7848

edited Jun 20 '16 at 10:23

answered Jun 20 '16 at 10:21

Dan

1,209
3
13
29

Don't use `is` for string comparisons. – Daniel Roseman Jun 20 '16 at 10:22
1

@Dan I think phogi's answer is more time efficient since it does not bother to check for 4th index – prashantitis Jun 20 '16 at 10:27

Suppress 'u' in Output - Python

3 Answers3