Mozilla DeepSpeech is a TensorFlow implementation of Baidu's DeepSpeech architecture.
Questions tagged [mozilla-deepspeech]
103 questions
15
votes
2 answers
How to implement Mozilla DeepSpeech into PHP web app to convert Speech-to-text?
I have a PHP web application and am looking for an open source, high-accuracy speech-to-text recognition implementation that will take voice commands to open web pages from users. Examples: "Make Sales" (this will open Create Sales PHP page), "Make…

Priyesh
- 501
- 3
- 12
9
votes
2 answers
How do I get started training a custom voice model with Mozilla TTS on Ubuntu 20.04?
I'd like to create a custom voice in Mozilla TTS using audio samples I have recorded but am not sure how to get started. The Mozilla TTS project has documentation and tutorials, but I'm having trouble putting the pieces together -- it seems like…

GuyPaddock
- 2,233
- 2
- 23
- 27
6
votes
2 answers
Cannot install deepspeech of python
I want to use DeepSpeech of Mozilla on my Linux 22.04 system, following this website:
https://deepspeech.readthedocs.io/en/r0.9/?badge=latest
At the very beginning line, at
pip3 install deepspeech
I got this error:
ERROR: Could not find a version…

Reihaneh Kouhi
- 489
- 1
- 7
- 23
4
votes
2 answers
Trying to get DeepSpeech to work. SoX not found even though I pip installed it?
The error I'm getting is:
FileNotFoundError: [Errno 2] SoX not found, use 16000hz files or
install it: The system cannot find the file specified
My audio file is from an mp4 video file that was then converted to a WAV file from VLC. The sampling…

Vendolheim
- 51
- 2
4
votes
1 answer
ERROR installing deepspeech in ubuntu server
I was actually trying to install deepspeech to a raspberry 4 whith pip3 and python 3.6.9 when I got this error :
Could not find a version that satisfies the requirement deepspeech (from versions: )
No matching distribution found for deepspeech
when…
user12949921
4
votes
1 answer
How to install Mozilla's implementation of Baidu deep speech on Windows?
I'm on Windows 10, and when I try to install Mozilla's implementation of
Baidu Deep speech as shown here.
Using:
pip install deepspeech
I get this error : "Could not find a version that satisfies the requirement deepspeech (from versions:)
No…

Domy Kanyik-tesh
- 81
- 1
- 6
3
votes
1 answer
Converting Mozilla DeepSpeech model to use in tensorflow.js environment
I have been attempting to convert a Mozilla Deepspeech trained model for use in tensorflow.js, in ml5.js soundClassifier layer. My understanding is that Mozilla DeepSpeech uses TensorFlow. I have been trying to follow the hints found…

KevinHJ
- 1,014
- 11
- 24
3
votes
0 answers
My CTC loss model's loss stagnates and then outputs only blank characters
I am trying to implement BaiDu's DeepSpeech1 in keras using CTC loss, my code is below:
class dataGen(Sequence): # data generator for Mozilla common voice
def __init__(self, audiopaths, transcripts, batch_size):
self.x = audiopaths
…

hhaefliger
- 521
- 3
- 18
3
votes
2 answers
DeepSpeech giving bad results
I am new to DeepSpeech i followed this link to create Speech to text code, but my results are no where near to the original speech. I am using Deepspeech 0.6.1 and have installed the relevant pretrained model. I am using this link to create my wav…

Ironman
- 1,330
- 2
- 19
- 40
3
votes
1 answer
How to use mozilla deepspeech to convert speech to text using it's pre-trained model?
I want to convert speech to text using mozilla deepspeech. But the output is really bad.
I have downloaded mozilla's pre trained model and then what i have done is this:
BEAM_WIDTH = 500
LM_WEIGHT = 1.50
VALID_WORD_COUNT_WEIGHT =…

Amit Joshi
- 61
- 1
- 1
- 6
2
votes
1 answer
Mozilla DeepSpeech: How to generate a SRT file from multiple segmented audio file?
I've been following this guide on generating an SRT subtitle file from video/audio files using Mozilla DeepSpeech.
I've been able to remove the silent portion of the audio .wav file into multiple segmented .wav files based on the guide using…

SingularitySG
- 41
- 1
- 3
2
votes
1 answer
tensorflow dependencies continuously gives me errors in colab during installation of deepspeech environment
when I run the following command on Google Colab
!pip3 install --upgrade pip==20.0.2 wheel==0.34.2 setuptools==46.1.3
!pip3 install --upgrade --force-reinstall -e .
Got an error
ERROR: pip's dependency resolver does not currently take into…

chamod rathnayake
- 775
- 8
- 15
2
votes
2 answers
DeepSpeech failed to learn Persian language
I’m training DeepSpeech from scratch (without checkpoint) with a language model generated using KenLM as stated in its doc. The dataset is a Common Voice dataset for Persian language.
My configurations are as follows:
Batch size = 2 (due to cuda…

Soroush
- 83
- 8
2
votes
0 answers
Is there a way to continuously send snippets of audio being recorded in realtime to backend server in Flutter.io?
I am creating an application that uses Mozilla's Deep Speech API to transcribe the user's speech to text. The input requires audio files with some sort of format and in order for this app to work, I will need to continuously send these audio files…

frlzjosh
- 410
- 5
- 17
2
votes
1 answer
Can I install wheel package in buildroot?
I want to install deepspeech on my embedded device (don't worry about power requirements, I intend to deal with that later).
However I failed several times at installing it. For what I've seen it is because there is only wheel files available for…