I am looking to compliment my seaborn boxplots with a 5-number statistics summary using pandas's df.describe()
.
I have set my boxplot to ignore outliers. However I am not sure if df.describe()
ignores outliers by default, or if I need to remove them from my DataFrame before running df.describe().
So for example, I would compute the zscores for each row of data, and then drop all rows with with a zscore higher than 3. But if pandas already does that, maybe I'm doing the same process twice?
I compared my boxplot to the output of df.describe() and I honestly can't make out the difference with the naked eye.