I am working on a data set . It's first row looks like this:
Column1 Column2
1 [food=3, party=2,....] 2 [ocean=2, fish=3, surf=2,....] . .................. . .................. . .................. (Column1 has serial numbers and Column2 has the list of words with frequencies.)
Each row has multiple words with their respective frequencies.
I would like to convert the column2 as follows:
[food, food, food, party, party.....] and so on.
I am finding it difficult and don't know where to start. I tried tokenizing, but don't know how to proceed.