There are two id descriptions which are similar that is have say 80% similarity. I need both to be given same id.
There are other id descriptions having say 60% similarity. These should retain their own ids. Once an id desc has been considered and modified, it should not be taken as a reference. Further example:
id id description
1 pepsodent
2 pepsodent salt
3 pepsod
4 pepsodent and salt
5 peps
Now, pepsodent matches with pepsodent salt.therefor both should be given id as 1 Now as pepsodent salt has already been modified,it cannot be used as a scale of reference further.