Csv file tokenization using Pandas Python

Question

I can't find any example coding about how to do tokenization with csv file using Pandas Python. Below is my code with the cleaned review. I am going to use this "cleaned_review" to perform tokenization.

enter image description here

score 0 · Answer 1 · answered Aug 12 '20 at 04:56

0

to perform tokenization on cleaned_review, apply the NLTK word tokenizer row-wise to the column.

import nltk
steam_reviews['cleaned_review'] = steam_reviews.apply(lambda row: nltk.word_tokenize(row['cleaned_review']), axis=1)

answered Aug 12 '20 at 04:56

thorntonc

2,046
1
8
20

Csv file tokenization using Pandas Python

1 Answers1