Questions tagged [mozilla-deepspeech]

Mozilla DeepSpeech is a TensorFlow implementation of Baidu's DeepSpeech architecture.

Code: https://github.com/mozilla/DeepSpeech

103 questions
15
votes
2 answers

How to implement Mozilla DeepSpeech into PHP web app to convert Speech-to-text?

I have a PHP web application and am looking for an open source, high-accuracy speech-to-text recognition implementation that will take voice commands to open web pages from users. Examples: "Make Sales" (this will open Create Sales PHP page), "Make…
9
votes
2 answers

How do I get started training a custom voice model with Mozilla TTS on Ubuntu 20.04?

I'd like to create a custom voice in Mozilla TTS using audio samples I have recorded but am not sure how to get started. The Mozilla TTS project has documentation and tutorials, but I'm having trouble putting the pieces together -- it seems like…
GuyPaddock
  • 2,233
  • 2
  • 23
  • 27
6
votes
2 answers

Cannot install deepspeech of python

I want to use DeepSpeech of Mozilla on my Linux 22.04 system, following this website: https://deepspeech.readthedocs.io/en/r0.9/?badge=latest At the very beginning line, at pip3 install deepspeech I got this error: ERROR: Could not find a version…
Reihaneh Kouhi
  • 489
  • 1
  • 7
  • 23
4
votes
2 answers

Trying to get DeepSpeech to work. SoX not found even though I pip installed it?

The error I'm getting is: FileNotFoundError: [Errno 2] SoX not found, use 16000hz files or install it: The system cannot find the file specified My audio file is from an mp4 video file that was then converted to a WAV file from VLC. The sampling…
Vendolheim
  • 51
  • 2
4
votes
1 answer

ERROR installing deepspeech in ubuntu server

I was actually trying to install deepspeech to a raspberry 4 whith pip3 and python 3.6.9 when I got this error : Could not find a version that satisfies the requirement deepspeech (from versions: ) No matching distribution found for deepspeech when…
user12949921
4
votes
1 answer

How to install Mozilla's implementation of Baidu deep speech on Windows?

I'm on Windows 10, and when I try to install Mozilla's implementation of Baidu Deep speech as shown here. Using: pip install deepspeech I get this error : "Could not find a version that satisfies the requirement deepspeech (from versions:) No…
3
votes
1 answer

Converting Mozilla DeepSpeech model to use in tensorflow.js environment

I have been attempting to convert a Mozilla Deepspeech trained model for use in tensorflow.js, in ml5.js soundClassifier layer. My understanding is that Mozilla DeepSpeech uses TensorFlow. I have been trying to follow the hints found…
KevinHJ
  • 1,014
  • 11
  • 24
3
votes
0 answers

My CTC loss model's loss stagnates and then outputs only blank characters

I am trying to implement BaiDu's DeepSpeech1 in keras using CTC loss, my code is below: class dataGen(Sequence): # data generator for Mozilla common voice def __init__(self, audiopaths, transcripts, batch_size): self.x = audiopaths …
3
votes
2 answers

DeepSpeech giving bad results

I am new to DeepSpeech i followed this link to create Speech to text code, but my results are no where near to the original speech. I am using Deepspeech 0.6.1 and have installed the relevant pretrained model. I am using this link to create my wav…
Ironman
  • 1,330
  • 2
  • 19
  • 40
3
votes
1 answer

How to use mozilla deepspeech to convert speech to text using it's pre-trained model?

I want to convert speech to text using mozilla deepspeech. But the output is really bad. I have downloaded mozilla's pre trained model and then what i have done is this: BEAM_WIDTH = 500 LM_WEIGHT = 1.50 VALID_WORD_COUNT_WEIGHT =…
Amit Joshi
  • 61
  • 1
  • 1
  • 6
2
votes
1 answer

Mozilla DeepSpeech: How to generate a SRT file from multiple segmented audio file?

I've been following this guide on generating an SRT subtitle file from video/audio files using Mozilla DeepSpeech. I've been able to remove the silent portion of the audio .wav file into multiple segmented .wav files based on the guide using…
SingularitySG
  • 41
  • 1
  • 3
2
votes
1 answer

tensorflow dependencies continuously gives me errors in colab during installation of deepspeech environment

when I run the following command on Google Colab !pip3 install --upgrade pip==20.0.2 wheel==0.34.2 setuptools==46.1.3 !pip3 install --upgrade --force-reinstall -e . Got an error ERROR: pip's dependency resolver does not currently take into…
2
votes
2 answers

DeepSpeech failed to learn Persian language

I’m training DeepSpeech from scratch (without checkpoint) with a language model generated using KenLM as stated in its doc. The dataset is a Common Voice dataset for Persian language. My configurations are as follows: Batch size = 2 (due to cuda…
2
votes
0 answers

Is there a way to continuously send snippets of audio being recorded in realtime to backend server in Flutter.io?

I am creating an application that uses Mozilla's Deep Speech API to transcribe the user's speech to text. The input requires audio files with some sort of format and in order for this app to work, I will need to continuously send these audio files…
2
votes
1 answer

Can I install wheel package in buildroot?

I want to install deepspeech on my embedded device (don't worry about power requirements, I intend to deal with that later). However I failed several times at installing it. For what I've seen it is because there is only wheel files available for…
1
2 3 4 5 6 7