3

In pandas I am using drop_duplicates(subset='Full Name', inplace=True) to remove duplicate records, however I would like to view the removed duplicate records for manual inspection. How can I do this?

Striezel
  • 3,693
  • 7
  • 23
  • 37
LCott
  • 43
  • 1
  • 5

1 Answers1

2

Use duplicated and parameter keep=False for return all dupes with boolean indexing for filtering:

df = pd.DataFrame({'Full Name':list('abcdac'),
                   'Some val':[4,5,4,5,5,4]})

print (df)
  Full Name  Some val
0         a         4
1         b         5
2         c         4
3         d         5
4         a         5
5         c         4

df = df[df.duplicated(subset='Full Name', keep=False)]
print (df)
  Full Name  Some val
0         a         4
2         c         4
4         a         5
5         c         4
jezrael
  • 822,522
  • 95
  • 1,334
  • 1,252