I want to visualize the statistics of commits in our project. I would like to classify them into several groups based on some metrics, like commit messages or the number of changed lines of code.
Is there any systematic approach I can use? I have trouble with the selection of right keywords for commit message classification and with the selection of proper thresholds for size classification.