I have so many Unicode strings and want to store them in MySQL database. Also I want to add an extra field such that represents the character identity of the string. For example:
String key
------ -----------
this is 1st string 113547858
this is first string 113547865
I go to school 524872354
As you may have noticed above, the first 2 keys are so close to each other, representing strings similarity, whereas the 3rd one is so far from them.
I don't want to use PHP's similar_text
or levenshtein
as they need two strings to check similarity, but I want to store a value for each single string to store in DB in order to put an index on it for future use.