i have a dataframe which consists of 472 rows and 32 columns and it looks like this:
2 3 0 4 2 0 0 5 2 3 3 3 2 0 5 5 3 3 3 2 2 0 2 5 3 3 3 2 2 2 0 5
2 3 0 4 2 0 0 5 2 3 3 3 2 0 5 5 3 3 3 2 2 0 2 5 3 3 3 2 2 2 0 5
2 3 0 4 2 0 0 5 2 3 3 3 2 0 5 5 3 3 3 2 2 0 2 5 3 3 3 2 2 2 0 5
here, every row represent 32 teeth of a person and each number between 0-5 represent different teeth categories. now i want to measure the distance between any 2 rows by using different distance metrics (eg MANHATTAN, EUCLID, MINKOWSKI). so, the less the difference the more likely they are the same people etc.
*if i apply ONE-HOT-ENCODING before computing these metrics, there will be more than 32 columns for every row, which will be useless for me.
*i also found cdist and pdist, but these functions give me element-wise distance results. but what i want is to obtain a "single result" between any two rows.
am i trying something non-sense or what should i do to be able to compute these distances ?