I am trying to merge fuzzy two tables but they are really big and need too much time. Could you help and tell how I can parallelize these codes?
Many thanks!
for i in list1:
mat1.append(process.extract(i, list2, limit=2))
SE['MergeName'] = mat1
SE
for j in SE['MergeName']:
for k in j:
if k[1] >= threshold:
p.append(k[0])
mat2.append(",".join(p))
p = []
# storing the resultant matches
# back to dframe1
SE['MergeName'] = mat2
SE