I write a Java code to cluster a huge dataset which has about 100000rowsx100000columns(sparse rows). But dataset is created with sparse instances, so it has the structure of a sparse matrix.
I can use 3 clustering functions in my code:
JavaML: Kmeans, Weka: SimpleKmeans, Weka: Xmeans
I have run Weka's SimpleKmeans function but it is working about 9 hours and clustering process in not over yet. What is the estimated running time of these functions and which one is the most suitable for this dataset best?