I have a dictionary that looks like this:
my_dict = {'Community A': ['User 1', 'User 2', 'User 3'],
'Community B': ['User 1', 'User 2'],
'Community C': ['User 3', 'User 4', 'User 5'],
'Community D': ['User 1', 'User 3', 'User 4', 'User 5']}
My goal is to model the networked relations between the different communities and their sets of unique users to see which communities are most similar. Currently, I am am exploring using Jaccard similarity.
I have come across answers that do similar operations, but only on exactly 2 dictionaries; in my case, I have several, and will need to calculate the similarities between each set.
Also, some of the lists are of different lengths: in other answers, I saw 0
sub in as a missing value in that case, which I think will work in my case.