Questions tagged [speaker-diarization]
7 questions
1
vote
0 answers
Why am I getting "index 0 is out of bounds for axis 0 with size 0 when using pyAudioAnalysis library?
This question is about Speaker diarization. I'm trying to make a script that separates a mp4 file into different segments depending on different speakers. (The input mp4 file contains the dialogue of 4 different speakers)
The code is as…

RonaLightfoot
- 45
- 6
0
votes
0 answers
Way to Offline Speaker Diarization with Hugging Face
I am looking for Offline / locally saved model for speaker diarization with Hugging face without Authentication.
I have gone through google and found no relevant links for the same.
Is there any link/method to do the same?
Thanks in advance

san1
- 455
- 2
- 11
0
votes
0 answers
Adding speaker differentiation to Whisper API
I have been developing a Python application that takes an audio file and puts it through Whisper API and spits it out in a .docx file.
Does anyone know how to add speaker differentiation to it.
Example without speaker differentiation
Hey, have you…

Tom Hayward
- 1
- 1
0
votes
0 answers
Training Speaker_ID SpeechBrain
I am encountering difficulties while attempting to train the SpeechBrain speaker identification model using my own WAV files. I have made some modifications to the existing code provided in the train.yaml file, but I'm not sure if I'm even modifying…

clee
- 3
- 2
0
votes
0 answers
Speaker diarization with Node js using openai
I am doing whisper transcription in the node js using openai API. I am able to get the transcriptions for chunked files using ffmpeg library. I am struggling at the point of speaker diarization with node js. It would be great if get any guidance on…

Zeenath
- 329
- 1
- 5
- 22
0
votes
0 answers
How to add speaker labels in AWS Transcribe streaming websockets
I'm using the AWS Transcribe example from https://github.com/amazon-archives/amazon-transcribe-websocket-static with a simple modification on the websocket query-string to add speaker labels.
The current websocket URL path and querystring are…

trenta3
- 123
- 7
0
votes
0 answers
How to use pretrained UIS-RNN model?
google have UIS-RNN model for Speaker Diarization (https://github.com/google/uis-rnn).
In their examples they show how to train from scratch the proposed model.
Is it possible to used google pretrained UIS-RNN model ? (instead of training the model…

user3668129
- 4,318
- 6
- 45
- 87