Questions tagged [metaphone]

Metaphone is a phonetic algorithm published in 1990 for indexing words by their English pronunciation.

Metaphone fundamentally improves on the Soundex algorithm by using information about variations and inconsistencies in English spelling and pronunciation to produce a more accurate encoding, which does a better job of matching words and names which sound similar. As with Soundex, similar sounding words should share the same keys. Metaphone is available as a built-in operator in a number of systems, including later versions of PHP.

48 questions
16
votes
2 answers

Enabling soundex/metaphone for non-English characters

I've been studying soundex, metaphone and other string search techniques the past few days, and in my understanding both algorithms work well in handling non-English words transliterated to English. However the requirement that I have would be for…
Jon Limjap
  • 94,284
  • 15
  • 101
  • 152
14
votes
2 answers

Algorithm using soundex() or metaphone() to create Mad Gab style phrases

I'm attempting to create an algorithm that will suggest Mad Gab style phrases. The input is a set of phrases. I also have a set of keywords that I'd like to use when possible. Currently, my solution is simply brute force: loop over phrases…
Jason McCreary
  • 71,546
  • 23
  • 135
  • 174
14
votes
1 answer

metaphone versus soundex versus NYSIIS

I'm trying to come up with an implicit spell checker that will use the mappings of input words to some kind of more general phonetic representation to account for typos that might occur, basically for a search bar that will automatically correct…
Slater Victoroff
  • 21,376
  • 21
  • 85
  • 144
11
votes
6 answers

What is the Metaphone 3 Algorithm?

I want to code the Metaphone 3 algorithm myself. Is there a description? I know the source code is available for sale but that is not what I am looking for.
necromancer
  • 23,916
  • 22
  • 68
  • 115
8
votes
4 answers

MetaPhone Functions (like SoundEx) functions and use in R?

I am wanting to use MetaPhone, Double Metaphone, Caverphone, MetaPhone3, SoundEx, and if anyone has done it yet NameX functions within 'R' so I can categorize and summarize like values to minimize data cleansing operations prior to analysis. I am…
CRSouser
  • 658
  • 9
  • 25
7
votes
1 answer

PHP metaphone implementation bug

I'm testing a metaphone implementation for C# and comparing its results against the built-in metaphone() function from PHP. However, I've come across a bug (which is previously documented in PHP's issue tracker and discussed on a mailing list), but…
Chris
  • 27,596
  • 25
  • 124
  • 225
5
votes
1 answer

How to decide which Encoder to use for which language in Elasticsearch "Phonetic Token filter"?

I have used Metaphone and soundex Encoder with "Phonetic Token Filter" in Elasticsearch. Metaphone is good for English words. Soundex is good for English as well as Hindi maybe many other languages as well. I want to know which of these encoders is…
Abhinav Keshri
  • 595
  • 5
  • 20
5
votes
2 answers

What are the practical usages of functions metaphone() and soundex() in PHP?

Metaphone and Soundex are phonetic algorithms for indexing strings by their English pronunciation. Have you ever used functions metaphone() or soundex() that are present in the standard PHP library? What for? What are real-life usages of these…
Tom Pažourek
  • 9,582
  • 8
  • 66
  • 107
5
votes
1 answer

Speech to Text on Android with custom unusual word matching

I would like to be able to use Android's Speech-To-Text engine to recognize a variety of unusual words in sentences. To give an example, the word "electroencephalograph" comes out of STT as "electronics supply graph". When I use Soundex or…
Buns of Aluminum
  • 2,439
  • 3
  • 26
  • 44
4
votes
3 answers

Undefined symbols for architecture x86_64: ... "_main", referenced from: implicit entry/start for main executable

Yak-shaving alert. Although I am precluded from displaying any source code, I figure with a well-written post I may be able to provide enough info to get assistance. The steps I have tried below have all been garnered from other posts, and it's…
Pyderman
  • 14,809
  • 13
  • 61
  • 106
4
votes
1 answer

Replace words using Soundex, python

i have a list of sentences and basically my aim is to replace all diff occurrences of prepositions in the form "opp,nr,off,abv,behnd" with their correct spellings "opposite,near,above,behind" and so on. The soundex code of the words are same so i…
Hypothetical Ninja
  • 3,920
  • 13
  • 49
  • 75
3
votes
1 answer

Localized (Double) Metaphone for Portuguese (pt_PT)

I want to see how phonetically similar two non-English strings are, AFAIK soundex and metaphone implementations only work correctly for English based strings, for instance coração and corassão sound exactly the same in Portuguese but metaphone()…
Alix Axel
  • 151,645
  • 95
  • 393
  • 500
3
votes
3 answers

Why isn't this MySQL double metaphone function working correctly?

I am just learning about the Metaphone and Double Metaphone search algorithms, and I have a few questions. Per the Metaphone Wiki page, I found a couple sources with implementations, a MySQL implementation in particular. I wanted to test it out…
user1077685
2
votes
2 answers

Metaphone 3 information

Does anyone know where code can be found for Metaphone 3 matching for T-SQL or at least something that describes in detail the difference between Double Metaphone and Metaphone 3? I have been working on evaluating different matching algorithms in…
I_AM_JARROD
  • 685
  • 2
  • 7
  • 20
2
votes
1 answer

Perfomance for join table with string comparison

I have two large tables and I need to fit them together. Matching should not be a clear comparison. I used trigrams, Levenshtein's formula, but I get very weak performance. Maybe someone can help improve performance. The size of table A is about 200…
1
2 3 4