how to drop a row of a dataframe based on the sum of three columns

Question

suppose I have a dataframe in the form of:

my goal is to remove the rows where 100 minus the sum of these three columns is greater than 0.5.

how can I do that?

Here are a few ways for you to filter out the rows from a [dataframe](https://www.listendata.com/2019/07/how-to-filter-pandas-dataframe.html), and on [Stack Overflow](https://stackoverflow.com/questions/11869910/pandas-filter-rows-of-dataframe-with-operator-chaining) — Joe Ferndz, Mar 30 '21 at 21:51
Does this answer your question? [How to select rows from a DataFrame based on column values](https://stackoverflow.com/questions/17071871/how-to-select-rows-from-a-dataframe-based-on-column-values) — Henry Ecker, Mar 30 '21 at 21:54
I have added my solution as an answer. please check it out and see if it is a valid method. — Alirezaro93, Mar 30 '21 at 22:01

ashkangh · Answer 1 · 2021-03-30T22:01:16.760

4

One solution:

(100 - df.sum(1)).le(0.5)

This is a boolean mask that you can give it to a dataframe:

df[(100 - df.sum(1)).le(0.5)]

edited Mar 30 '21 at 22:01

answered Mar 30 '21 at 21:56

ashkangh

1

Note the requirement is to remove the rows where ..... – SeaBean Mar 30 '21 at 21:59
Thanks for letting me know. I rectified it. – ashkangh Mar 30 '21 at 22:01

score 0 · Accepted Answer · answered Mar 30 '21 at 21:59

0

Well here is what I came up with and it works just fine:

indexNames = df[((100 - df1['a'] - df['b'] - df['c']).abs() > 0.5)].index

df.drop(indexNames , inplace=True)

answered Mar 30 '21 at 21:59

Alirezaro93

in the method that I have proposed, it can take care of any dataframe and it also considers absolute vale of the result so it would be more robust – Alirezaro93 Mar 30 '21 at 22:04

2 Answers2