Pandas fill cells with NaN when conditions are met

Question

import pandas as pd
import numpy as np
df = pd.DataFrame([[1,2,3],[4,5,6]], columns=['a','b','c'])
df[df['c'] == 3]['a'] = np.nan
df
>>>
    a   b   c
0   1   2   3
1   4   5   6

Why is 0,a not NaN? I would expect that the value in 0,a would have been replaced by np.nan.

jezrael · Accepted Answer · 2022-01-18T10:17:13.300

1

Problem is you create chained indexing:

df[df['c'] == 3]

with:

df[df['c'] == 3]['a']

So after set values is not propagate new value to original DataFrame.

For avoid it use DataFrame.loc:

df.loc[df['c'] == 3, 'a'] = np.nan
print (df)
     a  b  c
0  NaN  2  3
1  4.0  5  6

More information.

edited Jan 18 '22 at 10:17

answered Jan 18 '22 at 10:09

jezrael

822,522
95
1,334
1,252

Pandas fill cells with NaN when conditions are met

1 Answers1