I have installed the NLTK library on two computers, in one of them is working fairly good (It processes about 1000 sentences in about 1 minute), and in my other computer it takes 1 minute for 10 sentences.
Saying that my second computer is faster, so it has nothing to do with my second computer.
This is the way I have installed it:
pip install nltk
then, I run python
In the python terminal: import nltk
then, ntlk.download()
It says that I have some of the all-corpora packages out of date (I don't know why) but it seems it is only this one: PanLex Lite Corpus
, which I think has nothing do do with my problem.... and this other not installed: Cross-Framework and Cross-Domain Parser Evaluation Shared Task
. I don't know if it could have something do do...
Those are the modules I am using:
from nltk import pos_tag
from nltk import word_tokenize
from nltk.stem.wordnet import WordNetLemmatizer
And they work terribly slow...
Does anyone know why and know how to solve it?