2

So I have a dataset consisting 130000 points, in the format (x,y). My final goal is to cluster this data using kmeans. But for applying that, I need to find the optimum number of clusters to pass to the kmeans algorithm. How should I apply something like Gap Statistics or Levene's test in python to achieve this?

Siddharth Shah
  • 113
  • 4
  • 11

0 Answers0