Possible Duplicate:
How to get frequently occuring phrases with Lucene
I need to find most occuring words or word groups in an index which means most occuring text might be the word itself or a word group. Much similar to twitters trending topic (without hashtag entities ofcourse). Does Lucene provide some sort of method to do so or how can I achieve this in a massive data. If the question is unclear I can give examples to be more specific. I'm using java by the way and Lucene 3.5.
And a quick edit the "word group" can contain max 3 words. Let's say in a big text I have the word "is" 500 times "weather" 100 times "nice" 300 times and the word group "weather is nice" 90 times. I need to find if the occurence of "weather is nice" is important for me. And of course I need to look every indexed word...
Thank you.