1

I have configured an enhanement chain in stanbol including tika,language Detection, Custom Named Entity Recogniser and OpenNLP Sentence Detector.

I am clearly getting the NER and lang-detect output but the setence detection engine dosent seem to work.

Am I missing something here?

Enhancement Chain

NER and Lang-detect Output, Sentence detection output is not present

Geetanjali Agarwal
  • 101
  • 1
  • 1
  • 4

1 Answers1

0

Instead of using opennlp-sentence you should use opennlp-token. The sentence detection is useless because the model employed for the training model is only tokenized (and of course labelled with the entities). In output you won't ever see how the input is tokenized (or segmented into sentences), but it will show you the context from where it infers the output data.

Stephen Kennedy
  • 20,585
  • 22
  • 95
  • 108