I have two data sets var1 and var2. I want to compare the two distributions.
libary(ggplot2)
library(reshape)
set.seed(101)
var1 = rnorm(1000, 0.5)
var2 = rnorm(100000,0.5)
combine = melt(data.frame("var1" = var1,"var2"= var2))
ggplot(data = combine) + geom_density(aes(x = value, color = variable), alpha = 0.2)
This results in a density plot for the distribution which looks quite similar( except few wrigglings in median part), however, I want to show the tails of the distribution are not same. The spread is large for variable 2 than the one. Other than spread/quantiles which statistics could be compared to show the differences?
Does any one know any techniques, statistical test or visualization techniques especially to show the differences in the tails of the distribution(higher tails,higher extremes).