LingPipe is tool kit for processing text using computational linguistics.
Overview
LingPipe is Java based Natural Language Processing toolkit for text mining and analysis. LingPipe can be used to do tasks like:
- Named Entity Recognition
- Sentiment Analysis on Twitter Feeds
- Suggest spelling corrections
Architecture Overview
LingPipe's architecture is designed to be efficient, scalable, reusable, and robust. Highlights include:
- Java API with source code and unit tests;
- multi-lingual, multi-domain, multi-genre models;
- training with new data for new tasks;
- n-best output with statistical confidence estimates;
- online training (learn-a-little, tag-a-little);
- thread-safe models and decoders for concurrent-read exclusive-write (CREW) synchronization; and
- character encoding-sensitive I/O.
Licence
LingPipe is available under various licences ranging from free to commercial. These include:
- Affero General Public License (AGPL) (Non-Commercial)
- Developer (Commercial)
- Start Up (Commercial)
- Enterprise Server (Commercial)