Print characters for integer in any arbitrary encoding

Question

How can one perform the equivalent of the chr() function for any arbitrary encoding? Consider this attempt (doesn't work):

for i in range(128, 255):
    print("%s " % (i.encode('cp1252'),) )

This fails with AttributeError: 'int' object has no attribute 'encode'.

Neither does this attempt work:

for i in range(128, 255):
    j = "\x%x " % (i,)
    print("%x " % (j.encode('cp1252'),) )

This fails with SyntaxError: (unicode error) 'unicodeescape' codec can't decode bytes in position 0-2: truncated \xXX escape.

I am specifically targeting Python 3.

score 2 · Accepted Answer · answered Dec 15 '13 at 11:42

2

What exactly are you trying to achieve? The second attempt comes down to this:

for i in range(128, 255):
    j = chr(i)
    print("%x " % (j.encode('cp1252'),) )

But I don't see how that's the equivalent of chr for cp1252. chr(i) converts from the position i in the code page to the corresponding character (the return value is a unicode string of length 1). You can do a similar conversion for cp1252 (or any other encoding), using bytes([i]).decode('cp1252'), which looks more like your first attempt.

answered Dec 15 '13 at 11:42

raymonad

939
1
5
9

I had to decode `j` and print it as a string, but this worked. `print("%s " % (j.encode('cp1252').decode('cp1252'), ))`. Note also that code points 128-159 would not work, I had to start at 160. Thank you. – dotancohen Feb 09 '14 at 10:56

score 1 · Answer 2 · edited May 23 '17 at 10:32

1

In Python 3, I believe you want this,

chr(i).encode('cp1252')

or in Python 2.x, you would use,

unichr(i).encode('cp1252')

Full code is,

for i in range(128, 255):
    print("%s " % (unichr(i).encode('cp1252',errors='replace')))

On my console, the cp1252 encoding does not work and throws an exception. To at least print a hex value, you can include the errors part.

This post is relevant: Python 3: Demystifying encode and decode methods and How to print non-ascii characters to file in Python 2.7

edited May 23 '17 at 10:32

Community

1
1

answered Dec 15 '13 at 11:37

William Denman

3,046
32
34

And not with 3.3 either. – aIKid Dec 15 '13 at 11:40
Of course I tried it. When you say 'not working' can you be a little bit more specific? – William Denman Dec 15 '13 at 11:43
Looping through `range(128, 255)` throws a `UnicodeDecodeError` in the first character – aIKid Dec 15 '13 at 11:59

Print characters for integer in any arbitrary encoding

2 Answers2