I have a df as follows:
0 111155555511111116666611111111
1 555555111111111116666611222222
2 221111114444411111111777777777
3 111111116666666661111111111111
.......
1000 114444111111111111555555111111
I am calculating the distance between each string. For instance, to get the distance between the first 2 strings: textdistance.hamming(df[0], df[1])
. This will return a single integer.
Now, I want to create a df that stores all the distance between each string. In this case, since I have 1000 strings, I will have a 1000 by 1000 df. The first value is distance between string 1 and itself, then string 1 and string2 and so on. Then in next row its string 2 and string1, string 2 and itself and so on.