I am trying to merge two data sets with a string place
which is similar but in many cases not exactly the same.
For example there can be records like London, UK and just London in the other dataset. I would like to have a matching score next to each record and merge them on the similarity score (and merge if it is greater than .80).
Is there a way to merge two datasets A and B with variable place
?
The first dataset A has four variables:name, place, origin, ranking
The second dataset B has three variables: ratings, place, destinations, trip time
I want to merge dataset A and dataset B.
The final merged dataset should have the following variables:
name, place, origin, ranking, ratings, destinations, trip time
I want to merge based on dataset A (like a left join in SQL). I want all records of A with added columns from B.