Convert Numpy array of ASCII codes to string

Question

I would like to convert a NumPy array of integers representing ASCII codes to the corresponding string. For example ASCII code 97 is equal to character "a". I tried:

from numpy import *
a=array([97, 98, 99])
c = a.astype('string')
print c

which gives:

['9' '9' '9']

but I would like to get the string "abc".

score 11 · Answer 1 · answered Sep 29 '15 at 23:57

Another solution that does not involve leaving the NumPy world is to view the data as strings:

arr = np.array([97, 98, 99], dtype=np.uint8).view('S3').squeeze()

or if your numpy array is not 8-bit integers:

arr = np.array([97, 98, 99]).astype(np.uint8).view('S3').squeeze()

In these cases however you do have to append the right length to the data type (e.g. 'S3' for 3 character strings).

Ashoka Lella · Accepted Answer · 2014-07-19T08:43:31.700

10

print "".join([chr(item) for item in a])

output

abc

edited Jul 19 '14 at 08:43

answered Jul 19 '14 at 08:37

Ashoka Lella

6,631
1
30
39

Thanks Ashoka for the nice solution. I was too focused on trying to use a NumPy function, but this seems like an elegant solution. – Håkon Hægland Jul 19 '14 at 08:45

jtaylor · Answer 3 · 2014-07-19T09:46:14.417

7

create an array of bytes and decode the the byte representation using the ascii codec:

np.array([98,97,99], dtype=np.int8).tostring().decode("ascii")

note that tostring is badly named, it actually returns bytes which happens to be a string in python2, in python3 you will get the bytes type back which need to be decoded.

edited Jul 19 '14 at 09:46

answered Jul 19 '14 at 09:40

jtaylor

2,389
19
19

score 4 · Answer 4 · answered Sep 20 '21 at 14:30

4

import numpy as np
np.array([97, 98, 99], dtype='b').tobytes().decode("ascii")

Output:

'abc'

Data type objects (dtype)

tostring() is deprecated since version 1.19.0. Use tobytes() instead.

answered Sep 20 '21 at 14:30

ivanbgd

171
1
5

score 1 · Answer 5 · edited Sep 12 '19 at 23:53

1

from numpy import array

a = array([97, 98, 99])
print("{0:c}{1:c}{2:c}".format(a[0], a[1], a[2]))

Of course, join and a list comprehension can be used here as well.

edited Sep 12 '19 at 23:53

Boris Verkhovskiy

14,854
11
100
103

answered Jul 19 '14 at 08:49

nouseforname

720
1
5
17

But this *only works* for `len(a) == 3`, which seems very fragile. – jonrsharpe Jul 19 '14 at 08:56
@jonrsharpe i shoud've mentioned that i just wanted to show the "format()" method. Which could be used inside a loop. – nouseforname Jul 19 '14 at 09:58

nth · Answer 6 · 2020-02-24T21:26:36.673

1

Solutions that rely on Python loops or string formatting will be slow for large datasets. If you know that all of your data are ASCII, a faster approach could be to use fancy indexing:

import numpy as np
a = np.array([97, 98, 99])
np.array([chr(x) for x in range(127)])[a]
# array(['a', 'b', 'c'], dtype='<U1')

An advantage is that it works for arbitrarily shaped arrays.

edited Feb 24 '20 at 21:26

answered Feb 24 '20 at 21:16

nth

1,442
15
12

Convert Numpy array of ASCII codes to string

6 Answers6

Linked