I have used Metaphone and soundex Encoder with "Phonetic Token Filter" in Elasticsearch.
Metaphone is good for English words.
Soundex is good for English as well as Hindi maybe many other languages as well.
I want to know which of these encoders is best optimized for Hindi and if possible other Indian languages?
- Soundex
- Metaphone
- double_metaphone
- refined_soundex
- caverphone1 - English (New Zealand localised)
- caverphone2 - English (New Zealand localised)
- cologne - German
- nysiis - Improvized Soundex
- koelnerphonetik - German
- haasephonetik - German
- beider_morse - English and multiple European Language
- daitch_mokotoff - Slavic & Yiddish Surname
As This is not listed on Elasticsearch website for which Language we should choose which Encoder.
Also tell me which of the Encoders have you already used and for which language.