Im working on stackoverflow data dump .csv file and I need to to find the distribution of scores for questions.
I opened the file in R and extracted the two columns that I need which are the PostTypeID and Score.
example :
I need to find :
3 rows in the score column that has the score 11.
2 rows in the score column that has the score 3. .... etc
The thing is the data is too large, it has 3 million rows and I don't know how to get the distribution.
Note I'm a beginner in R, so I need the simplest way to do that.