Questions tagged [speaker-diarization]

7 questions
1
vote
0 answers

Why am I getting "index 0 is out of bounds for axis 0 with size 0 when using pyAudioAnalysis library?

This question is about Speaker diarization. I'm trying to make a script that separates a mp4 file into different segments depending on different speakers. (The input mp4 file contains the dialogue of 4 different speakers) The code is as…
0
votes
0 answers

Way to Offline Speaker Diarization with Hugging Face

I am looking for Offline / locally saved model for speaker diarization with Hugging face without Authentication. I have gone through google and found no relevant links for the same. Is there any link/method to do the same? Thanks in advance
0
votes
0 answers

Adding speaker differentiation to Whisper API

I have been developing a Python application that takes an audio file and puts it through Whisper API and spits it out in a .docx file. Does anyone know how to add speaker differentiation to it. Example without speaker differentiation Hey, have you…
0
votes
0 answers

Training Speaker_ID SpeechBrain

I am encountering difficulties while attempting to train the SpeechBrain speaker identification model using my own WAV files. I have made some modifications to the existing code provided in the train.yaml file, but I'm not sure if I'm even modifying…
0
votes
0 answers

Speaker diarization with Node js using openai

I am doing whisper transcription in the node js using openai API. I am able to get the transcriptions for chunked files using ffmpeg library. I am struggling at the point of speaker diarization with node js. It would be great if get any guidance on…
0
votes
0 answers

How to add speaker labels in AWS Transcribe streaming websockets

I'm using the AWS Transcribe example from https://github.com/amazon-archives/amazon-transcribe-websocket-static with a simple modification on the websocket query-string to add speaker labels. The current websocket URL path and querystring are…
0
votes
0 answers

How to use pretrained UIS-RNN model?

google have UIS-RNN model for Speaker Diarization (https://github.com/google/uis-rnn). In their examples they show how to train from scratch the proposed model. Is it possible to used google pretrained UIS-RNN model ? (instead of training the model…
user3668129
  • 4,318
  • 6
  • 45
  • 87