I have a pandas dataframe in a transactional format:
id purchased_item
1 apple
1 banana
1 carrot
2 banana
3 apple
4 apple
4 carrot
4 diet coke
5 banana
5 carrot
6 banana
6 carrot
I would like to convert this to the following:
[['apple', 'banana', 'carrot'],
['banana'],
['apple'],
['apple', 'carrot', 'diet coke'],
['banana', 'carrot'],
['banana', 'carrot']]
I have tried this:
df.groupby(['id'])['purchased_item'].apply(list)
The output looks like:
customer_id
1 [apple, banana, carrot]
2 [banana]
3 [apple]
4 [apple, carrot, diet coke]
5 [banana, carrot]
6 [banana, carrot]
What to do next? Or is there a different approach? Thanks a lot for help.