I want to see how relevant an incoming tweet is to my test article.I have a set of keyphrases in a string array extracted from the test article.I want to find a similarity score between this string array and incoming tweet (spark streaming) so that i could get some relevant tweets for the considered keywords.Please help me to find the similarity....
I have a String array str[a,b,c,....] I have an incoming tweet and I need to know how many strings from the array are present in the incoming tweet.The more the no.of strings matched,the more relevant is the tweet to my string of keyphrases taken.The tweet is an rdd and it has only the text in it.