Python, Pandas : Return only those rows which have missing values

Question

While working in Pandas in Python...

I'm working with a dataset that contains some missing values, and I'd like to return a dataframe which contains only those rows which have missing data. Is there a nice way to do this?

(My current method to do this is an inefficient "look to see what index isn't in the dataframe without the missing values, then make a df out of those indices.")

score 137 · Accepted Answer · edited Apr 13 '19 at 21:38

137

You can use any axis=1 to check for least one True per row, then filter with boolean indexing:

null_data = df[df.isnull().any(axis=1)]

edited Apr 13 '19 at 21:38

cs95

379,657
97
704
746

answered May 25 '15 at 23:18

metersk

11,803
21
63
100

5

`df.isnull()` returns DataFrame after 0.23. Use `df.isnull().values.any(axis=1)` is a bit faster. – user3226167 Jul 25 '19 at 02:14

score 4 · Answer 2 · edited Jun 03 '20 at 11:48

4

df.isnull().any(axis = 1).sum()

this gives you the total number of rows with at least one missing data

edited Jun 03 '20 at 11:48

David Buck

3,752
35
31
35

answered Jun 03 '20 at 09:59

Collins Kelechi

235
2
3

score 2 · Answer 3 · answered Aug 11 '20 at 18:40

2

If you want to see only the rows that contains the NaN values you could do:

data_frame[data_frame.iloc[:, insert column number here]=='NaN']

answered Aug 11 '20 at 18:40

João Vitor Gomes

317
3
12

score 0 · Answer 4 · edited Nov 05 '21 at 08:55

0

I just had this problem I assume you want to view a section of data frame made up of rows with missing values I used

df.loc[df.isnull().any(axis=1)]

edited Nov 05 '21 at 08:55

Vinson Ciawandy

996
11
26

answered Jul 12 '20 at 08:52

agravaine

1
2

score -1 · Answer 5 · answered Jan 04 '21 at 14:06

-1

You Can Use the code in this way

sum(df.isnull().any(axis=1))

answered Jan 04 '21 at 14:06

Ahmed Khater

7
1

score -3 · Answer 6 · answered Apr 09 '20 at 17:18

-3

If you are looking for a quicker way to find the total number of missing rows in the dataframe, you can use this:

sum(df.isnull().values.any(axis=1))

answered Apr 09 '20 at 17:18

Ikay

1

Python, Pandas : Return only those rows which have missing values

6 Answers6

Linked