Say I have this text = I love apples, kiwis, oranges and bananas
and the searchString = kiwis and bananas
and a similarity algorithm say Jaccard index. How can I efficiently find the substring in text
which has the highest similarity to searchString
.
Basically I am trying to find portions of text (text has high errors, misspellings, extra symbols and spaces) which match a list of keywords I have.