In pandas
I am using drop_duplicates(subset='Full Name', inplace=True)
to remove duplicate records, however I would like to view the removed duplicate records for manual inspection. How can I do this?
Asked
Active
Viewed 187 times
3
1 Answers
2
Use duplicated
and parameter keep=False
for return all dupes with boolean indexing
for filtering:
df = pd.DataFrame({'Full Name':list('abcdac'),
'Some val':[4,5,4,5,5,4]})
print (df)
Full Name Some val
0 a 4
1 b 5
2 c 4
3 d 5
4 a 5
5 c 4
df = df[df.duplicated(subset='Full Name', keep=False)]
print (df)
Full Name Some val
0 a 4
2 c 4
4 a 5
5 c 4

jezrael
- 822,522
- 95
- 1,334
- 1,252