pandas - updating a dataframe column with value of another column with if condition

Question

I want to update ColumnA if it have "0" and string "pandas" in ColumnC with the mean value [which i stored in columnB]

df['ColumnA'] = df.apply(lambda x: x['ColumnB'] if (x['ColumnA']==0 & x['ColumnC']=='pandas') else x['ColumnA'], axis=1)

I am getting this error

unsupported operand type(s) for &: 'int' and 'str'

kindly advise how i can fix it

Use `and` instead of `&`: `(x['ColumnA']==0 and x['ColumnC']=='pandas')` `&` is for pandas Series but since you are applying through rows `x['ColumnA']` is a python scalar, so you can't use `&`. — Psidom, Jun 22 '21 at 23:45

Michael Delgado · Answer 1 · 2021-06-22T23:50:39.170

Put parentheses around your conditions, and as pointed out in the comments, use and instead of & for scalar comparison, e.g.

((x['ColumnA'] == 0) and (x['ColumnC'] == 'pandas'))

See this question on order of operations - the bitwise operator & takes precedence over the boolean operator ==

That said, you should consider using a vectorized operation:

df['ColumnA'] = df['ColumnB'].where(
    ((df['ColumnA'] == 0) & (df['ColumnC'] == 'pandas')),
    df['ColumnA'],
)

This will be faster than df.apply in nearly all cases.

score 0 · Answer 2 · answered Jun 22 '21 at 23:44

0

Use and instead of & and also put brackets around the == tests:

df['ColumnA'] = df.apply(lambda x: x['ColumnB'] if (x['ColumnA']==0) and (x['ColumnC']=='pandas') else x['ColumnA'], axis=1)

answered Jun 22 '21 at 23:44

SeaBean

2 Answers2