This is a very vague question but I was wondering if there is some sort of function or package in R that merges or identifies similiar/identical observations in two or more columns (maybe even rates how each observation is similar to one another). I have two messy data sets that have some columns that have some unique identifiers but have a lot of spelling or small differences between the two. For example, you have a column like this:
c1 <- c("ELIZA 2A", "aaab", "Unique New York", "I slith the Sheeth", "fdasa", "Yellow leather")
c2<- c("ELIZA", "fjdkaldjlk", "Unique NY", "Slith Sheeth", "Y. Leather")
In this case, the 1st, 3rd, 4th and 6th elements in c1 are similar to the 1st, 3rd,4th, and 5th elements in c2. I would want some sort of function or algorithm that displays that, maybe shows how similar they are, and then merge them by either c1 or c2. These datasets have over 15,000 observations with even messier rows, but this was just an example. I hope that makes sense.
Thank you for your help!