Highest Voted 'speaker-diarization' Questions

1

vote

0 answers

Why am I getting "index 0 is out of bounds for axis 0 with size 0 when using pyAudioAnalysis library?

This question is about Speaker diarization. I'm trying to make a script that separates a mp4 file into different segments depending on different speakers. (The input mp4 file contains the dialogue of 4 different speakers) The code is as…

python pyaudioanalysis speaker-diarization

asked Aug 23 '23 at 04:11

RonaLightfoot

45
6

0

votes

0 answers

Way to Offline Speaker Diarization with Hugging Face

I am looking for Offline / locally saved model for speaker diarization with Hugging face without Authentication. I have gone through google and found no relevant links for the same. Is there any link/method to do the same? Thanks in advance

python huggingface-transformers huggingface speaker-diarization

asked Jul 26 '23 at 09:28

san1

455
2
11

0

votes

0 answers

Adding speaker differentiation to Whisper API

I have been developing a Python application that takes an audio file and puts it through Whisper API and spits it out in a .docx file. Does anyone know how to add speaker differentiation to it. Example without speaker differentiation Hey, have you…

python openai-whisper speaker-diarization

asked Jul 24 '23 at 14:20

Tom Hayward

1
1

0

votes

0 answers

Training Speaker_ID SpeechBrain

I am encountering difficulties while attempting to train the SpeechBrain speaker identification model using my own WAV files. I have made some modifications to the existing code provided in the train.yaml file, but I'm not sure if I'm even modifying…

speech-recognition speech speechbrain speaker-diarization

asked Jun 27 '23 at 16:52

clee

3
2

0

votes

0 answers

Speaker diarization with Node js using openai

I am doing whisper transcription in the node js using openai API. I am able to get the transcriptions for chunked files using ffmpeg library. I am struggling at the point of speaker diarization with node js. It would be great if get any guidance on…

node.js speech-to-text openai-api openai-whisper speaker-diarization

asked Jun 27 '23 at 06:45

Zeenath

329
1
5
22

0

votes

0 answers

How to add speaker labels in AWS Transcribe streaming websockets

I'm using the AWS Transcribe example from https://github.com/amazon-archives/amazon-transcribe-websocket-static with a simple modification on the websocket query-string to add speaker labels. The current websocket URL path and querystring are…

amazon-web-services websocket amazon-transcribe speaker-diarization

asked Jun 01 '23 at 19:39

trenta3

123
7

0

votes

0 answers

How to use pretrained UIS-RNN model?

google have UIS-RNN model for Speaker Diarization (https://github.com/google/uis-rnn). In their examples they show how to train from scratch the proposed model. Is it possible to used google pretrained UIS-RNN model ? (instead of training the model…

deep-learning voice diarization speaker-diarization

asked Apr 02 '23 at 07:51

user3668129

4,318
6
45
87

Questions tagged [speaker-diarization]

Why am I getting "index 0 is out of bounds for axis 0 with size 0 when using pyAudioAnalysis library?

Way to Offline Speaker Diarization with Hugging Face

Adding speaker differentiation to Whisper API

Training Speaker_ID SpeechBrain

Speaker diarization with Node js using openai

How to add speaker labels in AWS Transcribe streaming websockets

How to use pretrained UIS-RNN model?