I have two data frames with columns of interest 'ParseCom', which is the left index of this fuzzy join, and 'REF' which should be a substring of 'ParseCom' during a join.
This is iterating over the Dataframe, which is not recommended. How can I implement a fuzzy join in Dask where I am joining on substrings?
for i, com in enumerate(defects['ParseCom']):
for j, sub in enumerate(repair_matrix['REF']):
if sub in com:
print(i,j, com)