I am new to R, and I've been facing this problem from quite some time. Whenever I try to make deciles or quartiles using 'dplyr' package, my deciles get merged into fewer groups. Like I want 10 different groups, whereas I only get 6, 4 or sometimes only 3. I know R tries to group/merge small deciles if it has lesser data. But I want to avoid this problem. Please help! Thanks!!
the code is:
mydata <- data.frame(col1= c(0,00,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,1,5,3,12,5,65,23,65984,21,5469,321,6,100,200,300,400,500,600,700,800,900,1000,1100,1200,1300,1400,1500,5233,18000))
DecLocations <- quantile(mydata$col1, probs = c(0.1,0.2,0.3,0.4,0.5,0.6,0.7,0.8,0.9))
mydata$decile <- findInterval(mydata$col1,c(-Inf,DecLocations, Inf))
require(dplyr)
mydata$decile<-factor(mydata$decile)
decile_grp<-group_by(mydata,decile)
decile_summ_test<-summarize(decile_grp, total_cnt=sum(col1))
decile_summ_test<-arrange(decile_summ_test, desc(decile))
View(decile_summ_test)
In here I'm only getting first 6 Deciles because R merges the small deciles. This is what I'm trying to avoid in here. I am expecting to get all 10 deciles, even if they have really small numbers.