I've been digging around about how to properly prepare data for clustering, and I came across this tutorial that explains you can't just randomly normalize each column, because normalizing a power law distribution will not yield a correct transformation (and you should use a log transform in that case).
I'm trying to transform a dataframe with 200+ columns (after preparing and removing mostly empty and autocorrelated columns). So my question is, is there a way to automatically check the distribution of each feature and then make the most fitting transformation (normalization for Gaussian distro, log transform for power law distro, using quantiles for "unrecognizable" distros etc.) automatically? Or is this something I have to do by hand for all those columns? Thank you!