I've been searching for a while now, but found nothing that suits my need so far. (This was helpful, but not convincing)
From two different sources, I get two different strings. I want to check, if the shorter one is contained within the larger one. However, as those strings both root in an OCR-document, there might be obvious differences.
Example:
String textToSearch = "Recognized Headline";
String documentText = "This is the document text, spanning multiple pages" .
"..." .
"..." .
"This the row with my Recognizect Head1ine embedded" .
"..." . ^^^^^^^^^^^^^^^^^^^^
"..." .
"End of the document";
How can I find my string reliably in the page without using a standalone Lucene/Solr installation? (Or maybe I've just not found the tutorial/manual). There must be some library out there which can do this, right?