I'm struggling to convert an array into individual tokens. Currently I used the following code, but am not getting to the exact output that I want. As I would like the numbers to be part of it too.
text = df.head(3)[['processed_arti', 'cluster']].values // where df is a pandas dataframe
terms = [b for l in text for b in zip (l[0].split(" "))]
I've added another picture below showing a bit more detail of how the data looks. Read in a pandas dataframe.
I'd really appreciate any help on this. Thanks in advance.