Creating dictionary from dataframe avoiding repetition

Question

I have a 2-columns df with a particular distribution of items. The first column shows a repetition of items. In the second column, there are no items repeated.

I have been trying to create a dictionary in which keys save the name of the first column and values save the items of the second column. Let's see my table and the dictionary I would like to create for a better understanding.

dict
{'A': '1', '2','3','4','9','C', 'B': '2', '3','4','29','34'}

Could someone put me in the right direction?

score 4 · Accepted Answer · answered Apr 03 '20 at 11:39

4

Close, what need is dictionary of lists, values are strings, because C:

d = df.groupby('col1')['col2'].agg(list).to_dict()
print (d)
{'A': ['1', '2', '3', '4', '9', 'C'], 'B': ['2', '3', '4', '29', '34']}

answered Apr 03 '20 at 11:39

jezrael

822,522
95
1,334
1,252

score 1 · Answer 2 · answered Apr 03 '20 at 11:40

1

Try this:

new_dict = df.groupby('col1')['col2'].apply(list).to_dict()

answered Apr 03 '20 at 11:40

Carsten

2,765
1
13
28

Creating dictionary from dataframe avoiding repetition

2 Answers2