The title speaks for itself. This is a code excerpt using the Spacy NLP framework.
with open("text.txt") as sentences:
docs = list(nlp.pipe(sentences.readlines()))
I tried using this package but it didn't seem to support one liners in the way that I would like.
The end goal is to be able to tell how long it will take to tokenize a very large amount of data with reasonable ETA.
How can this be accomplished?