Suitable library for working with large data (> 10 GB) in R given enough memory (60 GB)

Asked Oct 02 '17 at 09:48

Active Oct 02 '17 at 10:41

Viewed 30 times

EDIT : This question is not a duplicate as only reading data is not a problem

I want to do analysis on a csv file in R that is around 10 GB. I am working on a GCE virtual machine that has 60 GB memory.

I would like to know which library of R is suitable for reading and performing operations like filter, groupBy, colMeans etc. with large files

Which of the following should be the best choice (given that memory is not a constraint) -

edited Oct 02 '17 at 10:41

asked Oct 02 '17 at 09:48

Yashu Seth

@Roland Please remove the duplicate tag as I have reframed my question. – Yashu Seth Oct 02 '17 at 10:44
`fread` comes from the `data.table` package. This works well for big , but in memory, data aimed at group / aggregate calculations – user20650 Oct 02 '17 at 15:29

0 Answers0