Questions tagged [phoneme]

A phoneme in linguistics is the smallest part of a word that can be pronounced. The word "the", for example, consists of phonemes /th/ and /e/. Use this tag to ask about how phonemes can be handled in natural language processing applications, or to identify them.

A phoneme in linguistics is the smallest part of a word that can be pronounced. The word "the", for example, consists of /th/ and /e/. Use this tag to ask about how phonemes can be handled in natural language processing applications, or to identify them.

For instance, in the Speech Synthesis Markup Language used by speech synthesisers to read text, the pronunciation of a phoneme can be specified:

<ssml:phoneme alphabet="x-microsoft-ups" ph="string"> </ssml:phoneme>

Here is what Wikipedia has on phonemes.

53 questions
18
votes
7 answers

API to break voice into phonemes / synthesize new speech given speech samples?

You know those movies where the tech geeks record someone's voice, and their software breaks it into phonemes? Which they can then use to type in any phrase, and make it seem as if the target is saying it? Does that software exist in an API…
AShelly
  • 34,686
  • 15
  • 91
  • 152
18
votes
4 answers

iOS / C: Algorithm to detect phonemes

I am searching for an algorithm to determine whether realtime audio input matches one of 144 given (and comfortably distinct) phoneme-pairs. Preferably the lowest level that does the job. I'm developing radical / experimental musical training…
P i
  • 29,020
  • 36
  • 159
  • 267
18
votes
3 answers

convert sound to list of phonemes in python

How do I convert any sound signal to a list phonemes? I.e the actual methodology and/or code to go from a digital signal to a list of phonemes that the sound recording is made from. eg: lPhonemes = audio_to_phonemes(aSignal) where for…
Roman
  • 8,826
  • 10
  • 63
  • 103
11
votes
2 answers

Estimate Phonemic Similarity Between Two Words

I am working on detecting rhymes in Python using the Carnegie Mellon University dictionary of pronunciation, and would like to know: How can I estimate the phonemic similarity between two words? In other words, is there an algorithm that can…
duhaime
  • 25,611
  • 17
  • 169
  • 224
11
votes
10 answers

Does a pronounceable encoding exist?

I am using UUIDs, but they are not particularly nice to read, write and communicate. So I would like to encode them. I could use base64, or base32, but they would not be easy anyway: base64 has capitalized letters and symbols. Base32 is a bit…
Stefano Borini
  • 138,652
  • 96
  • 297
  • 431
7
votes
4 answers

Rhyme Dictionary from CMU pronunciation database

I'm looking for a free or open source rhyming database. I've found the CMU pronunciation "database" and its series of apps but I can't make sense of them or figure out where the data's coming from. A simple text file with the word and its phonemes…
Kevin
  • 388
  • 7
  • 23
6
votes
4 answers

Speech to Phoneme in .Net

The problem is that I want to get phonemes of a audio speech in C# language. say you have an audio file like "x.wav" that says "hello dear Shamim". i want to extract all the phonemes of the speech and their relative timings. something like the…
Shamim
  • 434
  • 4
  • 11
4
votes
2 answers

Extracting phonemes from audio (speech) files

I am looking for a toolkit/library that will allow me to extract phonemes from audio files. I see on the CMU Sphinx website that sphinx3 can be used to output phonemes, however I haven't been able to successfully build it on my windows box. Apart…
sandesh247
  • 1,658
  • 1
  • 18
  • 24
4
votes
3 answers

Detect similar sounding words in Ruby

I'm aware of SOUNDEX and (double) Metaphone, but these don't let me test for the similarity of words as a whole - for example "Hi" sounds very similar to "Bye", but both of these methods will mark them as completely different. Are there any…
JP.
  • 5,507
  • 15
  • 59
  • 100
3
votes
3 answers

getX() has private access in java.awt.Rectangle?

Why following code generate error message : getX() has private access in java.awt.Rectangle (int)dest.getX(), (int)dest.getY(), (int)dest.getWidth(), (int)dest.getHeight() According to the doc , Rectangle do have a public…
pierrotlefou
  • 39,805
  • 37
  • 135
  • 175
3
votes
1 answer

Convert word to phonetic pronunciation with Sphinx4

I am trying to achieve English words to Phoneme word translation to create a language model for CMUSphinx. Now I am using the following tool to achieve the word translation link For example if I give a text file as input with the following word it …
luckysing_noobster
  • 1,933
  • 5
  • 24
  • 50
3
votes
2 answers

Are there any open-source phoneme sets (for speech synthesis)?

I am trying to make a super basic speech synthesizer, and I need some form of phoneme audio files so that I can piece them together and build words. Are there any open phoneme sets that I would be able to use for this?
SilentLupin
  • 658
  • 10
  • 16
3
votes
1 answer

Using SAPI 5.1 under Win7 64

Im writing an automated Lip Syncing Tool that is based on the Annosoft Console Program (Link to the Annosoft Tool). This Program uses the SAPI 5.1. For me it works great on Win XP 32 Bit. But is there any way to get this to work under a Win7 64 Bit…
KlausHerbert
  • 31
  • 1
  • 3
3
votes
2 answers

CMU Sphinx4 phoneme dictation

How can I configure sphinx4 to be able to detect only phonemes in a dictation? I've already read about partial results "You can control how often the result listener is fired by setting the configuration variable 'featureBlockSize' in the…
Marl
  • 1,492
  • 2
  • 22
  • 37
2
votes
1 answer

voice recognition splitting words to phoneme level

I am thinking of developing voice recognition software for my native language and I am thinking of using CMUSphinx-4 for that. there is a CMU dictionary file which contains English words that maps with split of the original word to its phoneme…
jan
  • 53
  • 6
1
2 3 4