Questions tagged [any23]

Anything To Triples (any23) is a library, a web service and a command line tool, written in Java, that extracts structured data in RDF format from a variety of Web documents.

Anything To Triples (any23) is a library, a web service and a command line tool, written in Java, that extracts structured data in RDF format from a variety of Web documents.

external reference, see project website

4 questions
8
votes
1 answer

Extract JSON-LD from HTML using Apache Any23

My aim is to extract structured data from webpages. I'm using the code mentioned in this SO question. I'm using Apache Any23 CLI library dependency in my Spring project. By using this, I'm able to extract the HTML5 Microdata (Schema.org) from…
Abhishek Ramachandran
  • 1,160
  • 1
  • 13
  • 34
3
votes
0 answers

Microdata Extraction with MicrodataExtractor of Any23

I am trying to understand microdata extraction with Any23, but can't find much of useful documentation. Can anybody give me a simple example of how to use MicrodataExtractor to extract microdata from HTML pages into RDF document in java?
Sherzad
  • 31
  • 2
2
votes
1 answer

How to add Apache Any23 RDF Statements to Apache Jena?

Basically, I use the Any23 distiller to extract RDF statements from files embedded with RDFa (The actual files where created by DBpedia Spotlight using the xhtml+xml output option). By using Any23 RDFa distiller I can extract the RDF statements (I…
user1479847
  • 107
  • 11
1
vote
3 answers

Parse json-ld generated by Apache Any23 into Java Pojo using Jackson

I want to map structured data (microdata, jsonld) extracted from html text into a Java POJO. For extraction I use the library Apache Any23 and configured a JSONLDWriter to convert the structured data found in the html document into json-ld…
wero026
  • 1,187
  • 2
  • 11
  • 23