I often work with big CSV files (>50GB) with >10,000 columns. I'm thinking of reading the information into RSQLite
so that I can do easy querying and subsetting.
ISSUE
SQLite has a limit of 2000 fields and 999 host parameters in a single SQL (see SQLite website). According to this, there is a way to change these parameters for SQLite (i.e. by re-compiling SQLite from source).
QUESTION
How do I make the above changes if I'm using RSQLite
package? The funny thing is that according to the package's changelog, the default values have already been increased to 30,000 and 40,000 respectively as early as ver 0.11.0 back in 2011-12-01. But when I test with dataframes with 10,000 columns, RSQLite v2.1.1
still threw an error.
What am I missing here? If I need to re-compile SQLite to change the values, how do I get RSQLite
to use that new version?
Thanks in advance!