Remove all rows which each value is the same

Question

I want to drop all rows that have same values by drop_duplicates(subset=['other_things','Dist_1','Dist_2']) but could not get it.

Input

  id  other_things  Dist_1  Dist_2
    1   a             a       a
    2   a             b       a
    3   10            10      10
    4   a             b       a
    5   8             12      48
    6   8             12      48

Expeted

  id  other_things  Dist_1  Dist_2
    2   a             b       a
    4   a             b       a
    5   8             12      48
    6   8             12      48

Try

df =  df.drop_duplicates()

please specify which columns to consider for dropping duplictes — iamklaus, Mar 01 '19 at 12:19
Use `df[df.duplicated(subset=['other_things','Dist_1','Dist_12'], keep=False)]` — jezrael, Mar 01 '19 at 12:20

score 0 · Answer 1 · answered Mar 01 '19 at 12:18

0

It looks like the 'id' column could be generating problems.

Would recommend using the 'subset' parameter on drop duplicates as per the documentation.

drop_duplicates documentation1

answered Mar 01 '19 at 12:18

ecortazar

1,382
1
6
12

Remove all rows which each value is the same

Input

Expeted

Try

1 Answers1