Questions tagged [opennlp]

Apache's libraries for natural language processing (NLP).

The Apache OpenNLP library is a machine learning based toolkit for the processing of natural language text. It supports the most common NLP tasks, such as tokenization, sentence segmentation, part-of-speech tagging, named entity extraction, chunking, parsing, and coreference resolution. These tasks are usually required to build more advanced text processing services. OpenNLP also included maximum entropy and perceptron based machine learning.

More about Natural Language Processing :

Natural language processing (NLP) is the ability of a computer program to understand human speech as it is spoken.

Apache OpenNlp is often used with Apache Flink(a document query library).

Relevant Links,

http://searchcontentmanagement.techtarget.com/definition/natural-language-processing-NLP https://opennlp.apache.org/docs/.

Cornerstone books: https://www.manning.com/books/taming-text

684 questions
44
votes
4 answers

Entity Extraction/Recognition with free tools while feeding Lucene Index

I'm currently investigating the options to extract person names, locations, tech words and categories from text (a lot articles from the web) which will then feeded into a Lucene/ElasticSearch index. The additional information is then added as…
Karussell
  • 17,085
  • 16
  • 97
  • 197
26
votes
3 answers

How to detect that two sentences are similar?

I want to compute how similar two arbitrary sentences are to each other. For example: A mathematician found a solution to the problem. The problem was solved by a young mathematician. I can use a tagger, a stemmer, and a parser, but I don’t…
SahelSoft
  • 615
  • 2
  • 9
  • 22
24
votes
4 answers

Training own model in opennlp

I am finding it difficult to create my own model openNLP. Can any one tell me, how to own model. How the training shouls be done. What should be the input and where the output model file will get stored.
user1482228
  • 241
  • 1
  • 2
  • 3
23
votes
3 answers

Training n-gram NER with Stanford NLP

Recently I have been trying to train n-gram entities with Stanford Core NLP. I have followed the following tutorials - http://nlp.stanford.edu/software/crf-faq.shtml#b With this, I am able to specify only unigram tokens and the class it belongs to.…
18
votes
3 answers

How to use OpenNLP with Java?

I want to POStag an English sentence and do some processing. I would like to use openNLP. I have it installed When I execute the command I:\Workshop\Programming\nlp\opennlp-tools-1.5.0-bin\opennlp-tools-1.5.0>java -jar opennlp-tools-1.5.0.jar…
shababhsiddique
  • 904
  • 3
  • 14
  • 40
17
votes
1 answer

How to create a good NER training model in OpenNLP?

I just have started with OpenNLP. I need to create a simple training model to recognize name entities. Reading the doc here https://opennlp.apache.org/docs/1.8.0/apidocs/opennlp-tools/opennlp/tools/namefind I see this simple text to train the…
Dail
  • 4,622
  • 16
  • 74
  • 109
17
votes
3 answers

Accuracy: ANNIE vs Stanford NLP vs OpenNLP with UIMA

My work is planning on using a UIMA cluster to run documents through to extract named entities and what not. As I understand it, UIMA have very few NLP components packaged with it. I've been testing GATE for awhile now and am fairly comfortable…
Drag
  • 171
  • 1
  • 4
16
votes
4 answers

Sentence compression using NLP

Using Machine translation, can I obtain a very compressed version of a sentence, eg. I would really like to have a delicious tasty cup of coffee would be translated to I want coffee Does any of the NLP engines provide such a functionality? I got a…
Codevalley
  • 4,593
  • 7
  • 42
  • 56
16
votes
3 answers

opennlp chunker and postag results

Java - opennlp I am new to opennlp and i am try to analyze the sentence and have the post tag and chunk result but I could not understand the values meaning. Is there any table which can explain the post tag and chunk result values full form meaning…
d-man
  • 57,473
  • 85
  • 212
  • 296
15
votes
3 answers

OpenNLP vs Stanford CoreNLP

I've been doing a little comparison of these two packages and am not sure which direction to go in. What I am looking for briefly is: Named Entity Recognition (people, places, organizations and such). Gender identification. A decent training…
Mike Thomsen
  • 36,828
  • 10
  • 60
  • 83
15
votes
1 answer

How apache UIMA is different from Apache Opennlp

I have been doing some capability testing with Apache OpenNLP, Which has the capability to Sentence detection, Tokenization, Name entity recognition. Now when i started looking at UIMA documents it is mentioned on the UIMA home page - "language…
vashishth
  • 2,751
  • 4
  • 38
  • 68
13
votes
2 answers

Is there a way to get the subject of a sentence using OpenNLP?

Is there a way to get the subject of a sentence using OpenNLP? I'm trying to identify the most important part of a users sentence. Generally, users will be submitting sentences to our "engine" and we want to know exactly what the core topic is of…
rockit
  • 3,708
  • 7
  • 26
  • 36
13
votes
3 answers

How to extract the noun phrases using Open nlp's chunking parser

I am newbie to Natural Language processing.I need to extract the noun phrases from the text.So far i have used open nlp's chunking parser for parsing my text to get the Tree structure.But i am not able to extract the noun phrases from the tree…
user2024234
  • 217
  • 1
  • 4
  • 8
13
votes
3 answers

Extracting noun phrases from a text file using stanford typed parser

I have a text which I want to extract the noun phrases from it. I can easily get the typed parser for the text that i have, but wondering how i can extract the noun phrases in the text ?
S Gaber
  • 1,536
  • 7
  • 24
  • 43
12
votes
1 answer

RNTN implementation in java

I want to implement a Recursive neural tensor network(RNTN) in java. I've used Deeplearning4j for word2vec pipeline to vectorize a corpus of words. for NLP pipeline I've used Opennlp.( for tokenizing, POStaging and parsing) Now, I figured out that I…
iamgr007
  • 966
  • 1
  • 8
  • 28
1
2 3
45 46