I have a data.frame with 10 cols and about 700K rows.
I want to use the pairs(data.frame) function to show a pairwise scatterplot of the column values. It is not necessary (or feasible) to plot all 700K rows in each plot so I'd like to select a random subset of say 2 or 3K (some small number) of rows to be plotted.
Can someone please assist with my options to select a small random subset of my data frame. I think that either
- a random subset of X% of the data.frame or
- every Nth row would work.
I know I've seen this done but can't locate the code snippet ....
thanks