How to use numpy with 'None' value in Python?

Question

I'd like to calculate the mean of an array in Python in this form:

Matrice = [1, 2, None]

I'd just like to have my None value ignored by the numpy.mean calculation but I can't figure out how to do it.

+1: this question can be particularly relevant for arrays that are imported from a database, where values can sometimes be NULL. — Eric O. Lebigot, Nov 22 '11 at 22:30

tom10 · Accepted Answer · 2021-01-28T16:20:39.087

12

You are looking for masked arrays. Here's an example.

import numpy.ma as ma
a = ma.array([1, 2, None], mask = [0, 0, 1])
print "average =", ma.average(a)

From the numpy docs linked above, "The numpy.ma module provides a nearly work-alike replacement for numpy that supports data arrays with masks."

edited Jan 28 '21 at 16:20

answered Jun 07 '09 at 18:10

tom10

3

a member function that helped a lot was `filled`. that brought the masked array back to a normal array, filled with a value that I would recognize as invalid (NaN, -9999, whatever your users need). – mariotomo Apr 22 '10 at 09:20
1

Performance of masked arrays is also significantly less than regular numpy arrays as the implementation is pure Python. If you are dealing with big data, be aware of the performance implications. – timbo Dec 03 '14 at 23:37
Better to use numpy.nanmean than looking for ad-hoc solutions outside of numpy; see answer below. – strangeloop Jan 25 '21 at 16:42
Masked arrays are not ad-hoc nor outside of numpy. The docs link in my answer shows this. – tom10 Jan 28 '21 at 16:17

score 7 · Answer 2 · answered Jun 07 '09 at 17:28

7

haven't used numpy, but in standard python you can filter out None using list comprehensions or the filter function

>>> [i for i in [1, 2, None] if i != None]
[1, 2]
>>> filter(lambda x: x != None, [1, 2, None])
[1, 2]

and then average the result to ignore the None

answered Jun 07 '09 at 17:28

cobbal

5

`x != None` is usually written `x is not None` (PEP 8: "Comparisons to singletons like None should always be done with 'is' or 'is not', never the equality operators.") – Eric O. Lebigot Nov 22 '11 at 22:27

score 6 · Answer 3 · answered Nov 22 '11 at 22:15

6

You can use scipy for that:

import scipy.stats.stats as st
m=st.nanmean(vec)

answered Nov 22 '11 at 22:15

Noam Peled

2

This doesn't work. `a = [1,2,None]` and then `st.nanmean(a)` results in a TypeError. – Nate Jun 26 '13 at 20:58
2

Yes, you are right, it works on numpy.nan, not on None. It's most useful when calculating the mean on numpy vector. – Noam Peled Jun 30 '13 at 15:18
4

Now you can use also numpy.nanmean – Noam Peled Dec 11 '15 at 03:10

endolith · Answer 4 · 2011-06-16T16:07:41.900

4

You might also be able to kludge with values like NaN or Inf.

In [1]: array([1, 2, None])
Out[1]: array([1, 2, None], dtype=object)

In [2]: array([1, 2, NaN])
Out[2]: array([  1.,   2.,  NaN])

Actually, it might not even be a kludge. Wikipedia says:

NaNs may be used to represent missing values in computations.

Actually, this doesn't work for the mean() function, though, so nevermind. :)

In [20]: mean([1, 2, NaN])
Out[20]: nan

edited Jun 16 '11 at 16:07

answered Dec 06 '09 at 02:26

endolith

6

Actually, `mean(a[~isnan(a)])` explicitly choosing all non-NaN values works. – u0b34a0f6ae Dec 07 '09 at 14:27

score 3 · Answer 5 · answered Dec 06 '09 at 02:30

3

You can also use filter, pass None to it, it will filter non True objects, also 0, :D So, use it when you dont need 0 too.

>>> filter(None,[1, 2, None])
[1, 2]

answered Dec 06 '09 at 02:30

YOU

score 3 · Answer 6 · answered Jul 30 '19 at 21:25

You can 'upcast' the array to numpy's float64 dtype and then use numpy's nanmean method as in the following example:

import numpy as np

arr = [1,2,3, None]
arr2 = np.array(arr, dtype=np.float64)
print(arr2) # [ 1.  2.  3. nan]
print(np.nanmean(arr2)) # 2.0

score -1 · Answer 7 · answered Jun 06 '18 at 19:30

-1

np.mean(Matrice[Matrice != None])

answered Jun 06 '18 at 19:30

Ishan Tomar

7 Answers7