I am looking for a toolkit/library that will allow me to extract phonemes from audio files. I see on the CMU Sphinx website that sphinx3 can be used to output phonemes, however I haven't been able to successfully build it on my windows box.
Apart from the library, what is the best way to go about building your own?