Loading large, compressed RDF file into Jena TDB2 without decompression

Question

I am trying to load a large RDF file (some hundred million triples) into a Jena TDB2 database. Fuseki with Graph Store Protocol does not work because the file is too large for clients. The file is compressed NTriples, roughly 20 times smaller then uncompressed NTriples file. Is it possible to load the data while uncompressing it on-the-fly? I tried with a named pipe but this does not work:

$ tdb2.tdbloader --loc $DB <(zcat rdf.nt.gz)
Can't read file : /dev/fd/63

Jena knows how to decompress .gz files: `tdb2.tdbloader --loc $DB rdf.nt.gz`. Otherwise the argument are `tdbloader ... -- -`, two minus to end the flags and one for stdin. — AndyS, Mar 10 '23 at 17:43

score 0 · Accepted Answer · answered Mar 14 '23 at 10:05

0

As mentioned by AndyS, compressed files can directly be passed to Jena TDB2 command line tools, so the this example works:

$ tdb2.tdbloader --loc $DB rdf.nt.gz

answered Mar 14 '23 at 10:05

Jakob

3,570
3
36
49

score -2 · Answer 2 · answered Mar 14 '23 at 10:29

-2

$ tdb2.tdbloader --loc $DB rdf.nt.gz

answered Mar 14 '23 at 10:29

Dev.Abdullah_jaber

1
1

1

As it’s currently written, your answer is unclear. Please [edit] to add additional details that will help others understand how this addresses the question asked. You can find more information on how to write good answers [in the help center](/help/how-to-answer). – Community Mar 19 '23 at 11:46

Loading large, compressed RDF file into Jena TDB2 without decompression

2 Answers2