I'm doing and application that do the fallowing:
1:
If some noise is detected by the microphone, its starts to record audio, until no noise is detected. After it, the audio is recorded to a wav file.
2:
I have to detect some words on it. There are only, 5 to 10 words to detect.
So far, my code only does the first part (detect noise and record audio). Now, I have a list with the following words: help, please, yes, no, could, you, after, tomorrow
. I need an offline way to detect if my sound contains these words. Is this possible? How can I do that? I'm using linux and there is no way to change my operational system to windows or use virtual machine.
I'm thinking to use the sound's spectrogram, create a train database and use some classifier to predict. For example, this is a spectrogram of a word. Is this a good technique to use?
Thanks.