I have a list of frequent terms obtained from different search queries of a certain user.
For example:
sports, badminton, soccer
soccer, sports
research, AI, algorithm
research, adaptive, personalized search
research, AI, neuro network
The goal here is to build a hierarchical user profile based on these frequent terms with the hypothesis that terms that frequently appear in such queries represent topics that interest the user. In the hierarchy, general terms with higher frequency are placed at higher levels, while specific terms with lower frequency are placed at lower levels of the hierarchic user profile.
The expected result would be a tree of the user profile looking somehow like this :
- User profile
- research
- AI
- algorithm
- personalized search
- ..
- sports
- soccer
- badminton
- research
So, I'm currently stuck and I don't even know how to begin or what libraries to use. (I'm working with java on this project).
I would be very grateful if somebody could provide me with some help.
Thanks in advance.