Multilingual text (NLP) processing toolkit for Python
From the homepage:
Polyglot is a natural language pipeline that supports massive multilingual applications.
Features:
- Tokenization (165 Languages)
- Language detection (196 Languages)
- Named Entity Recognition (40 Languages)
- Part of Speech Tagging (16 Languages)
- Sentiment Analysis (136 Languages)
- Word Embeddings (137 Languages)
- Morphological analysis (135 Languages)
- Transliteration (69 Languages)