I have a list of lists in the form:
[['alice', 'in', 'wonderland',....], ['the', 'final', 'showdown',....],.............]
Where each element corresponds to the word tokens of a specific document (that I have processed)
I want to create a term frequency and inverse document frequency matrix, but I'm not sure how to go about doing this.
I'm thinking of using a pandas dataframe to store the data for this but not really sure how to iterate over it to get the TF and IDF (I know nltk might have some tools)
Any help would be appreciated!