I read the posts on the topic like how to cluster docs based on their similarity here. But I still can not understand how it realizes it. My test is that I have the cos similarity measures of 10 docs. Below are some:
D1 D2 sim(D1,D2)
d1 d10 0.6823
d1 d2 0.6377
d1 d8 0.0307
d1 d9 0.0294
d1 d7 0.0284
d1 d3 0.0234
d1 d4 0.0199
d1 d6 0.0110
d1 d5 0.0030
d10 d2 0.7232
d10 d3 0.3898
d10 d4 0.3054
d10 d9 0.0256
d10 d7 0.0227
d10 d8 0.0226
d10 d6 0.0110
d10 d5 0.0060
d2 d3 0.7850
...
...
Can I cluster these docs solely based on the similarity measures? If I specify the number of clusters, how to do it? If I do not specify the number of clusters, can the algorithm automatically cluster those docs, how to do it? Thanks in advance.