I am trying to visualize my high dimensional data set in two axis or components using nonmetric multi-dimensional scaling. This function is available in scikit-learn library. Here is my code:
from sklearn.manifold import MDS
embedding = MDS (n_components=2, metric= False, n_init=2, max_iter=100,
verbose=0, eps=0.001, n_jobs=2, random_state=101
,dissimilarity='euclidean')
#precip=precip[0:100]
precip_transformed = embedding.fit_transform(precip)
precip_transformed
The default values for n_init is 4 and max_iter is 300 and n_jobs=None (which means -1). This takes forever to run even though I reduced the default values and increased the n_jobs. It also makes my notebook crash after a while. I should mention that my data has 20000 rows and when I keep the commented out line of the code (only 100 rows), it works. Does anyone know how I can make this work? faster or some way to make sure the notebook won't crash.