Questions tagged [vosk]

Questions about Vosk Speech Recognition Library

Vosk is a speech recognition toolkit. The best things in Vosk are:

  • Supports 9 languages - English, German, French, Spanish, Portuguese, Chinese, Russian, Turkish, Vietnamese. More to come.
  • Works offline, even on lightweight devices - Raspberry Pi, Android, iOS
  • Installs with simple pip3 install vosk
  • Portable per-language models are only 50Mb each, but there are much bigger server models available.
  • Provides streaming API for the best user experience (unlike popular speech-recognition python packages)
  • There are bindings for different programming languages, too - java/csharp/javascript etc.
  • Allows quick reconfiguration of vocabulary for best accuracy.
  • Supports speaker identification beside simple speech recognition.
65 questions
14
votes
0 answers

How to modify or edit the vosk dictionary?

I'm using vosk for speech recognition. Does anyone know where the vocabulary dictionary is located and how to edit it to add or remove words? Some Background on my project: I'm working on a Linguistic AI project. I needed a speech recognition…
James
  • 181
  • 2
  • 6
4
votes
1 answer

Vosk-api python for speech-recognition. Feature for google-like speech adaption?

So Vosk-api is a brilliant offline speech recogniser with brilliant support, however with very poor (or smartly hidden) documentation, at the moment of this post (14 Aug, 2020) The question is: is there any kind of replacement of…
3
votes
1 answer

How to implement multi languages models in VOSK?

I wondered how we can implement multi-language processing in an application with the Vosk library. I want to make an application that supports multi-languages like Persian, Kurdish, and English. The programming language that I want to use is Java…
3
votes
1 answer

Where should i put Model files of VOSK speech recognition in java? ERROR (VoskAPI:Model():model.cc:122)

I have tried to use VOSK but get this error: ERROR (VoskAPI:Model():model.cc:122) Folder 'fa' does not contain model files. Make sure you specified the model path properly in Model constructor. If you are not sure about relative path, use absolute…
3
votes
1 answer

How to use Wave file as input in VOSK speech recognition?

I have a project that needs to get a recorded file and then process by the code and extract the text from file and match the extracted file with the other text and verify it. my problem is: I can't use recorded file in code and it does'nt read the…
2
votes
1 answer

How can I get the start and end times of words in an audio file with a known transcript using Vosk?

I'm using Vosk (https://alphacephei.com/vosk/) in Python and I want to get the start and end times of every word in an audio file, and I have the transcript of the audio file. I'm using some code I found online to perform speech-to-text using Vosk,…
Jason Maldonis
  • 307
  • 3
  • 10
2
votes
1 answer

VOSK empty text output?(Java)

When i am trying to get output from the code below, that gives text file from Wave file that transfered from client to server through RecordRTC, LibVosk.setLogLevel(LogLevel.DEBUG); try (Model model = new Model("model"); …
2
votes
1 answer

vosk in python: get location of transcribed text in audio file

Using a file very similar to test_ffmpeg.py in the Vosk repository, I am exploring what text information I can get out of the audio file. Here is the code of the whole script I'm using. #!/usr/bin/env python3 from vosk import Model,…
chrisroode
  • 77
  • 10
2
votes
1 answer

How to set a wake up word for an virtual assistant using Vosk offline speech recognition (or any other fast offline speech recognizer)

I want a fast offline speech recognizer (like vosk or sphinx) as a wake-up word for google speech recognition so it doesn't always hear what we say. Want wake up word for this: def takeCommand(): # mic input r = sr.Recognizer() with sr.Microphone()…
2
votes
2 answers

Use the microphone in java for speech recognition with VOSK

I am trying to add real-time speech recognition to my java project (preferably offline). Through some googling and trying other solutions, I settled on using VOSK for my speech recognition. The primary problem I am encountering, however, is that…
Squalmals
  • 23
  • 1
  • 3
2
votes
1 answer

Cannot comiple .c because it cannot find .h file

I'm on ubuntu 18.04 and i'm trying to compile a .c file that came with an API that i'm working which is called vosk. The issue is that the python code works without any problems but if i try to gcc test_vosk.c -o test_vosk the .c file that they…
Birto
  • 71
  • 5
2
votes
1 answer

voice recognition constantly in background android

FfhggtffgtfgtgftI want develop application that whenever it recognizes a keyword it does something. it needs to be in listening mode all the time, in backgeound too. I was exposed to this and this. I treid run it but it is not work when I am…
Yaffa Harari
  • 43
  • 1
  • 6
2
votes
1 answer

Vosk (Kaldi) offline speech recognition in Unity

How to implement and use Vosk library into Unity project? Please write steps 1,2,3... Vosk library here - https://github.com/alphacep/vosk-api
1
vote
0 answers

How to get WAV audio from a microphone in Go

My program uses the Go bindings of the Vosk speech recognition library, which takes in the audio as byte slices of WAV mono audio. My program currently uses the external command arecord to get WAV audio from the microphone but I'd prefer to do it in…
GGG
  • 295
  • 1
  • 9
1
vote
1 answer

Using Services to create a Speech to Text Function with Vosk?

wanted to get some additional opinions on a project. I'm attempting to create a system which will gather the dialog from a call commenced by Twilio Studio and transcribe it. I then intend to push the now transcribed dialog to my CRM. I found a…
Luis R
  • 11
  • 2
1
2 3 4 5