I have a data frame with 10017 observations divided in 159 financial institutions. How can I improve the normality of the distributions of each financial institution without having to go to excel and manually changing data beyond +-3SD with the values on the 1% and 99% of the distribution?
I'm new to data analysis hence I hope it is clear
I asked for tapply(df$x, df$id, quantile, (0.01,0.99))
and then I changed the outliers on Excel