I have a dataset with 1,00,000 records. I need to find euclidean distance matrix for this dataset. It should create 1,00,000*1,00,000 matrix. In python we have squareform(pdist(x))
. As i cannot perform the same function on the rdd, How to do it on spark platform in python?
Asked
Active
Viewed 225 times
3

Munindra Naidu
- 31
- 1