I have a list of wikipedia users and the articles that they edited.
I'm trying to build a hierarchical profile for each one of them.
The problem is i'm struggling to get parent categories for each article.
What I want is for example is for an article about Pizza to get "dishes" or "food".
I'm using jena and Yago and a simple SPARQL query who looks like this:
String sparqlQueryString = "BASE <http://yago-knowledge.org/resource/>"
+ "PREFIX rdf: <http://www.w3.org/1999/02/22-rdf-syntax-ns#> "
+ "SELECT ?supercat WHERE { "
+ "<" + child + ">" + " rdf:type ?supercat . "
+ "}";
(Where child here is the article)
So i wanted to ask if somebody knew how to get a correct parent category from that. And also, if it's possible or if anyone has ideas about ordering all the titles of the articles and their parents into a hierarchic user interests profile.