I use a a dictionary to represent word count in a article
For example {"name" : 2 , "your": 10, "me", 20}
to represent that "name" appears twice, "your" appears 10 times and "me" appears 20 times.
So, is there a good way to calculate the euclidean distance of these vectors? The difficulty is that these vectors are in different lengths and some vectors contains certain words while some do not.
I know I sure can write a long function to do so, just look for a simpler and cleverer way. Thanks
Edit: The objective is to get the similarity between two article and group them