The error I'm getting is:
FileNotFoundError: [Errno 2] SoX not found, use 16000hz files or install it: The system cannot find the file specified
My audio file is from an mp4 video file that was then converted to a WAV file from VLC. The sampling rate is 8000 Hz by default, and it appears that DeepSpeech needs 16kHz files and therefore the SoX plugin.
I ran pip install SoX and pip install --upgrade SoX
.
Requirement already satisfied: SoX in e:\downloads\deep speech\lib\site-packages (1.4.1)
Requirement already satisfied: numpy>=1.9.0 in e:\downloads\deep speech\lib\site-packages (from SoX) (1.21.4)
So it's here. I then added E:\Downloads\Deep Speech\Lib\site-packages to system environment variables on Windows just in case. I'm new to Python in general and stumped here.
Could someone give me a hand?