I'm trying to create a forst for nearest neighbor searching but I'm not sure I'm doing it right or even if MinHash / LSH is an appropriate choice for my data. I ask this because the results are not usable.
I'm trying to follow the example in the documentation.
My Data:
512 dimensions, eg value is a bit eg 0 or 1 Is this actually usable for MinHash / LSH? And if yes, how would I construct the MinHash for each record?
As far as i understood the point of minhash is already to map the data to such a bit-structure? So i could just load the bits into it? As in h = MinHash(num_perm=512, hashvalues=listOfBits)
?