Questions tagged [data-dump]

57 questions
78
votes
14 answers

How to restore the dump into your running mongodb

I want to load data/restore dump data in mongoDB using mongorestore. I am trying to command mongorestore dump but it giving me error Sat Sep 21 16:12:33.403 JavaScript execution failed: SyntaxError: Unexpected identifier How can we restore or put…
Sandeep Singh
  • 7,790
  • 4
  • 43
  • 68
32
votes
6 answers

How do you sort the output of Data::Dumper?

I want to dump the values of my object and hash, but it keeps printing the keys out of order. How can I dump the keys in (recursive) sort-order? use Data::Dumper; print Dumper $obj;
qodeninja
  • 10,946
  • 30
  • 98
  • 152
6
votes
1 answer

Parse Wiktionary XML data dump into MySQL database using PHP

Alright, I'm just trying to parse Wiktionary Data Dump provided by Wikimedia. My intention is to parse that XML data dump into MySQL database. I didn't find proper documentation regarding the structure of this XML. Also, I'm not able to open the…
Jenson M John
  • 5,499
  • 5
  • 30
  • 46
6
votes
2 answers

What's the easiest way to convert an SO data dump from HTML back to Markdown?

I've just got my hands on a Stackoverflow data dump, and I'm disappointed to see that the Body field of the posts is in HTML rather than Markdown. I suspect there's Markdown in the original database because that's what I see if I try to edit an…
Norman Ramsey
  • 198,648
  • 61
  • 360
  • 533
4
votes
1 answer

Better way to dump huge file into MarkLogic DB

I am a new to MarkLogic and evaluating it to dump huge csv/text data with some transformation like filter based on some condition etc. As far as I know I can dump data using 2 ways. 1) Using java api for MarkLogic in a multi-threaded environment. 2)…
DMA
  • 1,033
  • 1
  • 11
  • 22
3
votes
1 answer

Downloading and using specific database from Freebase

I would like to download a part of the Freebase database (specifically the /film/film data dump). Currently I'm only seeing the option to download the entire data dump: https://developers.google.com/freebase/data Is there any option to download a…
David Tzoor
  • 987
  • 4
  • 16
  • 33
3
votes
1 answer

How can I increase XML data type in SQL Server 2008

I'm trying to import a large XML data-type in SQL Server 2008, but I'm getting the following error message: Msg 6365, Level 16, State 1, Line 3 An XML operation resulted an XML data type exceeding 2GB in size. Operation aborted. Does anyone know…
d-_-b
  • 21,536
  • 40
  • 150
  • 256
2
votes
0 answers

Very large write-intensive MySQL import

I have (what I would consider) a massive set of plain text files, around 400GB, that are being imported into a MySQL database (InnoDB engine). The .txt files range from 2GB to 26GB in size, and each file represents a table in the database. I was…
tom-g
  • 115
  • 2
  • 10
2
votes
1 answer

How to export primary keys on data-dump?

When I export my database with doctrine:data-dump, I encounter 2 problems: * the primary keys are not exported * instead of foreign keys columns correct name, it uses the name of the foreign table. For example, here are my tables: #…
Altefquatre
  • 125
  • 1
  • 2
  • 6
2
votes
2 answers

How to convert xml file of stack overflow dump to csv file

I have stack overflow data dump file in .xml format,nearly 27GB and I want to convert them in .csv file. Please somebody tell me, tools to convert xml to csv file or python program
Md Salim
  • 51
  • 6
2
votes
2 answers

Live private broadcasting online

I'd like to build a homemade land drone based in Raspberry Pi, a camera module, a WiFi module, some motors... I'd like it to be Internet-controlled, so I can use it remotely and do small tasks. It's a difficult project, but my only question is: is…
2
votes
2 answers

Unload from Redshift to S3 fails

I'm running the following command in Redshift: myDB=> unload ('select * from (select * from myTable limit 2147483647);') to 's3://myBucket/' credentials 'aws_access_key_id=***;aws_secret_access_key=***'; Here is what I get…
eistrati
  • 2,314
  • 6
  • 26
  • 35
2
votes
3 answers

Can I temporarily pause the RDF bulk loading process for a while and then recover it back to process?

I am currently bulk loading DBpedia and Freebase data dumps into the virtuoso open source edition v7 using the rdf_loader_run() procedure. The database is on our server and I have another graph IRI in Virtuoso running for an online service.The…
keepItSimple
  • 135
  • 5
2
votes
1 answer

Gettting Actor Ids and biographies from the data dumps or Freebase API

Does anyone know the best way of getting Actor Ids from Freebase data dumps, and later on getting the IMDB ids and biographies from the Freebase API?
Gidi
  • 181
  • 1
  • 10
2
votes
0 answers

User behaviors analysis, stackoverflow public data dump

I have a question - what would be the best way to figure out in which timezone particular user is situated based on the location field data? It seems like considerable amount of users have this field populated with some data, the form, however, is…
seninp
  • 712
  • 1
  • 6
  • 23
1
2 3 4