I have a dataset of
customer | income(k) | spend (k)
value value value
The data set has 40 entries.
I am using the elbow method to try and figure out how many clusters to use.
My question is how do I determine what number to use with set.seed()
?
code is below:
set.seed(?)
wcss = vector()
for (i in 1:10) wcss[i] = sum(kmeans(new_dataset, i)$withinss)
plot(1:10,
wcss,
type = 'b',
main = paste('The Elbow Method'),
xlab = 'Number of clusters',
ylab = 'WCSS')