0

I have a list as below

data_corpus = ["John likes to watch movies",
 "Mary likes movies too", 
"John also likes to watch football games"]

I want to get

['John', 'likes', 'to', 'watch', 'movies', 'Mary', 'likes', 'movies', 'too',
 'John', 'also', 'likes', 'to', 'watch', 'football', 'games']

I do

from nltk.tokenize import word_tokenize
tokenized = [word_tokenize(i) for i in data_corpus]
tokenized

ang get list of sentences instead of list of words

[['John', 'likes', 'to', 'watch', 'movies'],
 ['Mary', 'likes', 'movies', 'too'],
 ['John', 'also', 'likes', 'to', 'watch', 'football', 'games']]

How to fix it?

Edward
  • 4,443
  • 16
  • 46
  • 81

0 Answers0