I have 2 large .txt data frames I'm working with in R (greater than 5GB each with more than 5 million observations) and I was wondering if there was an easy way to randomly sample ~ 20,000 rows from each when reading the data in.
Presently, I cannot read the data in and then sample from there, as the data is too big and I run into an error.