3

I am using Wikidata enriched with other data sources and I must ingest the entire Wikidata JSON dump in a dev graph database of mine.

That's easy and once that's done, I want to keep my copy updated by querying the RecentChanges and LogEvents API endpoints to retrieve de changes/deletes/creates that occurred between two timestamps (I'd do so every few minutes) - and that's relatively easy too!

What I cannot get my head around is...

How to get the cutoff timestamp for a given JSON dump? Where is this available or how to figure it out since the modified timestamp and lastrevid fields aren't present in JSON dumps.

Lazhar
  • 1,401
  • 16
  • 37
  • It looks like this is not currently possible. Some of the dumps have structured data about them Example: https://dumps.wikimedia.org/wikidatawiki/20181020/dumpstatus.json But not the JSON entities dump. I have created the following task to track this: https://phabricator.wikimedia.org/T209390 – Addshore Nov 13 '18 at 16:37

0 Answers0