I want to perform Hierarchical Clustering in this dataset (107721 rows and 16 columns). In order to do this I have to calculate the distance matrix. When I use the dist function I get the error:
Error: cannot allocate vector of size 43.2 Gb
I have searched a lot and I haven't found a solution yet. Also, I saw that errors like this can be handled with bigmemory package. I read the documentation but I didn't understand how to use the functions of the package in order to solve this problem.
Does anyone know how to solve this problem?
Thanks in advance!