Questions tagged [vosk]

Questions about Vosk Speech Recognition Library

Vosk is a speech recognition toolkit. The best things in Vosk are:

Supports 9 languages - English, German, French, Spanish, Portuguese, Chinese, Russian, Turkish, Vietnamese. More to come.
Works offline, even on lightweight devices - Raspberry Pi, Android, iOS
Installs with simple pip3 install vosk
Portable per-language models are only 50Mb each, but there are much bigger server models available.
Provides streaming API for the best user experience (unlike popular speech-recognition python packages)
There are bindings for different programming languages, too - java/csharp/javascript etc.
Allows quick reconfiguration of vocabulary for best accuracy.
Supports speaker identification beside simple speech recognition.

65 questions

votes

0 answers

How to modify or edit the vosk dictionary?

I'm using vosk for speech recognition. Does anyone know where the vocabulary dictionary is located and how to edit it to add or remove words? Some Background on my project: I'm working on a Linguistic AI project. I needed a speech recognition…

dictionary speech-recognition vosk

asked Apr 24 '20 at 17:28

James

votes

1 answer

Vosk-api python for speech-recognition. Feature for google-like speech adaption?

So Vosk-api is a brilliant offline speech recogniser with brilliant support, however with very poor (or smartly hidden) documentation, at the moment of this post (14 Aug, 2020) The question is: is there any kind of replacement of…

python-3.x google-speech-to-text-api vosk

asked Aug 14 '20 at 11:06

George Ivannikov

votes

1 answer

How to implement multi languages models in VOSK?

I wondered how we can implement multi-language processing in an application with the Vosk library. I want to make an application that supports multi-languages like Persian, Kurdish, and English. The programming language that I want to use is Java…

speech-to-text vosk

asked Jan 29 '22 at 20:02

aniran mohammadpour

votes

1 answer

Where should i put Model files of VOSK speech recognition in java? ERROR (VoskAPI:Model():model.cc:122)

I have tried to use VOSK but get this error: ERROR (VoskAPI:Model():model.cc:122) Folder 'fa' does not contain model files. Make sure you specified the model path properly in Model constructor. If you are not sure about relative path, use absolute…

java vosk

asked Oct 24 '21 at 09:28

aniran mohammadpour

votes

1 answer

How to use Wave file as input in VOSK speech recognition?

I have a project that needs to get a recorded file and then process by the code and extract the text from file and match the extracted file with the other text and verify it. my problem is: I can't use recorded file in code and it does'nt read the…

python speech-recognition vosk

asked Jun 29 '21 at 09:07

Ebrahim Mohammadi

votes

1 answer

How can I get the start and end times of words in an audio file with a known transcript using Vosk?

I'm using Vosk (https://alphacephei.com/vosk/) in Python and I want to get the start and end times of every word in an audio file, and I have the transcript of the audio file. I'm using some code I found online to perform speech-to-text using Vosk,…

python speech-recognition speech-to-text vosk

asked Nov 27 '22 at 02:44

Jason Maldonis

votes

1 answer

VOSK empty text output?(Java)

When i am trying to get output from the code below, that gives text file from Wave file that transfered from client to server through RecordRTC, LibVosk.setLogLevel(LogLevel.DEBUG); try (Model model = new Model("model"); …

java spring vosk

asked Oct 24 '21 at 11:04

aniran mohammadpour

votes

1 answer

vosk in python: get location of transcribed text in audio file

Using a file very similar to test_ffmpeg.py in the Vosk repository, I am exploring what text information I can get out of the audio file. Here is the code of the whole script I'm using. #!/usr/bin/env python3 from vosk import Model,…

python speech-recognition speech-to-text vosk

asked Oct 11 '21 at 15:57

chrisroode

votes

1 answer

How to set a wake up word for an virtual assistant using Vosk offline speech recognition (or any other fast offline speech recognizer)

I want a fast offline speech recognizer (like vosk or sphinx) as a wake-up word for google speech recognition so it doesn't always hear what we say. Want wake up word for this: def takeCommand(): # mic input r = sr.Recognizer() with sr.Microphone()…

python artificial-intelligence speech-recognition vosk

asked Sep 14 '21 at 07:42

Snipey

votes

2 answers

Use the microphone in java for speech recognition with VOSK

I am trying to add real-time speech recognition to my java project (preferably offline). Through some googling and trying other solutions, I settled on using VOSK for my speech recognition. The primary problem I am encountering, however, is that…

java audio speech-recognition microphone vosk

asked Jul 15 '21 at 22:08

Squalmals

votes

1 answer

Cannot comiple .c because it cannot find .h file

I'm on ubuntu 18.04 and i'm trying to compile a .c file that came with an API that i'm working which is called vosk. The issue is that the python code works without any problems but if i try to gcc test_vosk.c -o test_vosk the .c file that they…

linux kaldi vosk

asked Jul 15 '21 at 07:48

Birto

votes

1 answer

voice recognition constantly in background android

FfhggtffgtfgtgftI want develop application that whenever it recognizes a keyword it does something. it needs to be in listening mode all the time, in backgeound too. I was exposed to this and this. I treid run it but it is not work when I am…

java android voice-recognition voice vosk

asked Sep 13 '20 at 08:06

Yaffa Harari

votes

1 answer

Vosk (Kaldi) offline speech recognition in Unity

How to implement and use Vosk library into Unity project? Please write steps 1,2,3... Vosk library here - https://github.com/alphacep/vosk-api

unity-game-engine speech-recognition offline kaldi vosk

asked Aug 14 '20 at 06:13

user14103335

vote

0 answers

How to get WAV audio from a microphone in Go

My program uses the Go bindings of the Vosk speech recognition library, which takes in the audio as byte slices of WAV mono audio. My program currently uses the external command arecord to get WAV audio from the microphone but I'd prefer to do it in…

go wav microphone audio-capture vosk

asked Apr 24 '23 at 22:30

GGG

vote

1 answer

Using Services to create a Speech to Text Function with Vosk?

wanted to get some additional opinions on a project. I'm attempting to create a system which will gather the dialog from a call commenced by Twilio Studio and transcribe it. I then intend to push the now transcribed dialog to my CRM. I found a…

twilio vosk

asked Jan 26 '23 at 15:50

Luis R

2 3 4 5 Next