I am struggling with the basics. I have just one column with names in pandas dataframe and I want to compare strings for potential duplicates using 3-4 functions from fuzzywuzzy library. So first name I want to check against the rest of the column content, then 2nd name and so on. Column will have hundreds if not thousands of names. I want to create a df with combination of names for which at least one of the values is above 80.
Do I need to create a list out of that df? Apologies, I know it is very basic I just can't seem to find a solution myself.