I have multiple columns in a dataframe.
Some have duplicates, others are completely empty.
I need to delete the duplicates within each column.
dict_A = {'Firm_A': ['A', 'A', 'B','C'],'Firm_B': [], 'Firm_C': ['B', 'A', 'B','D'], 'Firm_D': ['C', 'C', 'A','D']}
df = pd.DataFrame.from_dict(dict_A, orient = 'index')
df = df.T
df
Firm_A Firm_B Firm_C Firm_D
0 A None C C
1 A None A C
2 B None B A
3 C None D A
Desired Output:
Firm_A Firm_B Firm_C Firm_D
0 A None C C
1 B None A A
2 C None B None
3 None None D None
So far I have tried drop.duplicates().
I have around 3000 columns with each column having around 100 values.