Just doing some general research. Are there any open source (or even paid?) tools / programs that do the following:
INPUT: an audio file of some unlabeled speech, maybe a few sentences long, (no indication as to what the phonetic transcriptions are in the audio)
OUTPUT: an audio file with phonetic transcriptions (in the IPA alphebet) aligned and labeled on the audio
Is this possible to be done with just a phonetic dictionary and without a word dictionary?