Given a dataset with a non-uniform distribution (highly peaked) I want to resample to create a new dataset with an approximately uniform distribution. My approach:
- Divide the data into bins.
- Target bin level = Smallest number of samples per bin, among all bins.
- Randomly delete samples until each bin count = target bin level.
Is there a better technique?