here is data example:
import pandas as pd
df = pd.DataFrame({
'file': ['file1','file2','file1','file2','file3','file3','file4','file5','file4','file5'],
'prop1': ['True','False','True','False','False','False','False','True','False','False'],
'prop2': ['False','False','False','False','True','False','True','False','True','False'],
'prop3': ['False','True','False','True','False','True','False','False','False','True']
})
file prop1 prop2 prop3
0 file1 True False False
1 file2 False False True
2 file1 True False False
3 file2 False False True
4 file3 False True False
5 file3 False False True
6 file4 False True False
7 file5 True False False
8 file4 False True False
9 file5 False False True
I need to drop duplicated rows with same props values to another dataframe and cut them off original file.
So another dataframe should looks like this (duplicated rows should not repeat):
file prop1 prop2 prop3
0 file1 True False False
3 file2 False False True
8 file4 False True False
df = df.drop_duplicates() drop onlu 1 duplicated row, but not second like this:
file prop1 prop2 prop3
0 file1 True False False
1 file2 False False True
4 file3 False True False
5 file3 False False True
6 file4 False True False
7 file5 True False False
9 file5 False False True