How do I remove a specific value from a row in a pandas dataframe?

Question

I have a pandas dataframe that looks something like this:

   Column1 Column2 Column3
 0   1  NaN  NaN
 1   4  NaN  NaN
 2  NaN  3   NaN
 3  NaN  98  NaN
 4  NaN NaN  562
 5  NaN NaN  742
 .
 .
 .

How would I go about removing all of the unnecessary NaNs and make it look like this

   Column1 Column2 Column3
0    1   3   562
1    4   98  742
.
.
.

What is the issue, exactly? Have you tried anything, done any research? Where does that data even come from, is there no way of fixing it beforehand? — AMC, Apr 07 '20 at 19:00

Valdi_Bo · Accepted Answer · 2020-04-07T18:31:36.867

Run:

df.apply(lambda col: col.dropna().reset_index(drop=True).astype(int))

Just apply to each column a function, which drops NaN values in this column. Due to presence of NaN values column are generally of float type, but I attempt to cast them to int.

Note also that other solutions work only as long as each column contains equal number of non-NaN values.

To check it, add the following row:

6  NaN   NaN   999

to your 6 initial rows, so that now Column3 contains 3 non-Nan values, whereas other columns - only 2.

Solution by yatu drops this last row, whereas solution by Quang results in ValueError: arrays must all be same length.

But my solution works OK also in this case, leaving trailing NaN in "too short" columns.

score 1 · Answer 2 · answered Apr 07 '20 at 17:57

1

You can just dropna:

df.apply(lambda x: x.dropna().values)

Output:

   Column1  Column2  Column3
0      1.0      3.0    562.0
1      4.0     98.0    742.0

answered Apr 07 '20 at 17:57

Quang Hoang

146,074
10
56
74

score 1 · Answer 3 · answered Apr 07 '20 at 17:58

1

We can use justify here from the linked post:

pd.DataFrame(justify(df.values, invalid_val=np.nan, side='up', axis=0), 
             columns=df.columns).dropna()

  Row1  Row2   Row3
0   1.0   3.0  562.0
1   4.0  98.0  742.0

answered Apr 07 '20 at 17:58

yatu

86,083
12
84
139

How do I remove a specific value from a row in a pandas dataframe?

3 Answers3