I'd like to run hierarchical clustering on a "large" matrix of dimensions 69878 x 10
but can't manage to because running hclust
in R requires first computing the pairwise distances and doing so crashes on these dimensions:
> str(x)
num [1:69878, 1:10] 0 0 0 0 0 0 0 9 1 0 ...
- attr(*, "dimnames")=List of 2
..$ : NULL
..$ : chr [1:10] "0.5" "1" "1.5" "2" ...
> d <- dist(x)
Error: cannot allocate vector of size 18.2 Gb
Is there a way to circumvent this limitation?