Python pandas - impossible to delete duplicates

Question

I have a csv file with duplicates that are only in the column named "file". I wrote the following line:

df = pd.read_csv(path_to_file, encoding='utf-8', sep=',')
df.drop_duplicates(subset="Fichier",keep='first',inplace=True)

But it doesn't work. I even tried to do it via Excell but it doesn't work either..

Many thanks in advance!!

You can visit on [enter link description here](https://stackoverflow.com/questions/14984119/python-pandas-remove-duplicate-columns) — Hrushi, Jun 16 '22 at 09:41

Louis Chabert · Answer 1 · 2022-06-16T11:14:30.540

1

You can try this, it works for me :

#In my case
metadata = pd.read_csv('CSV/data_full.csv', low_memory=False)

myresult = pd.Series(metadata.index, index=metadata['Fichier']).drop_duplicates()

edited Jun 16 '22 at 11:14

answered Jun 16 '22 at 09:39

Louis Chabert

399
2
15

Thank you but I don't understand what is metadata? I have a csv file. – Balkhrod Jun 16 '22 at 10:41
for sure, my bad, I will eidt my post – Louis Chabert Jun 16 '22 at 11:13
Thank you but it don't work, i don't know why – Balkhrod Jun 16 '22 at 11:20
Your column name is file or Fichier ? – Louis Chabert Jun 16 '22 at 11:22
The name of my column is Fichier – Balkhrod Jun 17 '22 at 13:16

Python pandas - impossible to delete duplicates

1 Answers1