0

I am trying to merge fuzzy two tables but they are really big and need too much time. Could you help and tell how I can parallelize these codes?

Many thanks!

for i in list1:
    mat1.append(process.extract(i, list2, limit=2))
SE['MergeName'] = mat1 
SE
for j in SE['MergeName']:
    for k in j:
        
        if k[1] >= threshold:
            p.append(k[0])
              
    mat2.append(",".join(p))
    p = []
      
# storing the resultant matches 
# back to dframe1
SE['MergeName'] = mat2
SE

0 Answers0