Can I use cluster_center coordinates from a previous Kmeans fit as an init argument to sequentially update the cluster_center coordinates as new data arrives? Are there any drawbacks to this method?
UPDATED Online version of Scikit learns K-means:
KM = KMeans(n_clusters=3, random_state = 200, n_init = 1)
ni = 0
Until interrupted:
for x in data:
KM_updated = KM.fit(x)
Updated_centroids(i) = KM_updated.cluster_centers_(i) + 1/len(KM_updated.labels_(i) + 1) * (x - KM_updated.cluster_centers_(i))
KM = KMeans(n_clusters=3, random_state = 200, init = Updated_centroids(i), n_init = 1)