I am trying to import a dataset of 217,000 records (Jeopardy Dataset) into MonetDB through the MonetDB.R interface.
The file is a CSV file with top two lines as folows:
show_nos, air_dt, rnd, category, prize, ques, ans,x1,x2,x3
4680,12/31/2004,Jeopardy!,THE COMPANY LINE,$200 ,"In 1963, live on ""The Art Linkletter Show"", this company served its billionth burger",McDonald's,,,
4680,12/31/2004,Jeopardy!,EPITAPHS & TRIBUTES,$200 ,"Signer of the Dec. of Indep., framer of the Constitution of Mass., second President of the United States",John Adams,,,
The problem I face is while importing the ques
column (data between " "). That column has multiple commas and punctuations, and monet.read.csv is unable to import that column.
I tried importing a few records without the ques
column, and it works perfectly.
Can you please suggest on how to import such columns with free flow text in monetdb? Once imported I intend to perform some text analysis on the column.