Highest Voted 'diarization' Questions

4

votes

1 answer

Efficient speaker diarization

I am running a VM instance on google cloud. My goal is to apply speaker diarization to several .wav files stored on cloud buckets. I have tried the following alternatives with the subsequent problems: Speaker diarization on Google's API. This seems…

asked Feb 15 '23 at 10:17

Luis

330
1
11

2

votes

1 answer

How can I count the number of people speaks in an audio file

I'm working on an audio project. My goal is to count the number of people who spokes in an audio file. We can consider that we already removed the noise from that audio.(for example, if there are two people talking in the audio the program can…

audio signal-processing speech-recognition librosa diarization

asked May 24 '22 at 09:08

Kacem ICHAKDI

23
5

2

votes

0 answers

Speaker Diarization using Resemblyzer

I am new to Speaker Diarization and was exploring Resemblyzer library and have a few questions. I looked at the diarization demo here: demo02_diarization.py Use live audio stream instead of static audio files: I see that the demo uses a static mp3…

speech-recognition speech-to-text diarization

asked Nov 21 '20 at 17:03

Darth.Vader

5,079
7
50
90

1

vote

2 answers

Diart (torchaudio) on Windows x64 results in torchaudio error "ImportError: FFmpeg libraries are not found. Please install FFmpeg."

I am giving a try to a speech diarization project named diart (based on hugging face models) I follow the instructions using a miniconda environment which are essentially: conda create -n diart python=3.8 conda activate diart conda install portaudio…

python pytorch conda torchaudio diarization

asked May 02 '23 at 14:19

LoneWanderer

3,058
1
23
41

1

vote

1 answer

Segmention instead of diarization for speaker count estimation

I'm using diarization of pyannote to determine the number of speakers in an audio, where number of speakers cannot be predetermined. Here is the code to determine speaker count by diarization: from pyannote.audio import Pipeline MY_TOKEN = "" #…

python-3.x machine-learning audio cluster-analysis diarization

asked Mar 24 '23 at 12:50

Digil

61
9

1

vote

0 answers

Extracting voice of different speakers in overlapping speech using pyannote

I am using Pyannote for speaker diarization. I am able to get the overlapping speech's start and end time but not able to do voice separation. Is there a way to use Pyannote for voice separation? If not, please suggest some way to do voice…

speech-recognition speech-to-text voice-recognition diarization

asked Oct 19 '22 at 08:11

vaibhav jain

83
4

1

vote

0 answers

AttributeError: 'NoneType' object has no attribute 'items' in pyannote speaker diarization package

When working with the pyannote python package from GitHub (tutorial link -> https://github.com/pyannote/pyannote-audio/blob/develop/tutorials/voice_activity_detection.ipynb) I receive the following error: enter image description here The…

database speaker diarization

asked Jun 13 '22 at 17:33

askrobola

11
1

1

vote

1 answer

torch.hub.load('pyannote/pyannote-audio', 'dia') doesn't work in local

I was using this code in google colab but it doesn't work when I want to use it in local OWN_FILE = {'audio': 'file.wav'} pipeline = torch.hub.load('pyannote/pyannote-audio', 'dia') diarization = pipeline(OWN_FILE) Here is the warnings I get for…

speech-recognition torch diarization

asked May 12 '21 at 16:01

Pierre-Louis VENTRE

61
2

0

votes

1 answer

Google Speech-to-Text API Speaker Diarization with Python .long_running_recognize() method

I was following the answer in this question. But my audio is more then 1 min so I have to use .long_running_recognize(config, audio) method instead .recognize(config, audio). Here is the code: from pathlib import Path # …

python google-cloud-platform audio speech-to-text diarization

asked Aug 05 '23 at 15:07

Vasyl Kolomiets

365
8
20

0

votes

1 answer

Azure speech-to-text speaker identification (or diarization): no text and no guests

I run this sample code from here, just changing the file name and the number of channels from eight to two (one channel is not supported). My goal is to test the speaker identification. Actually the diarization, since I'm not providing speakers…

python azure speech-to-text diarization

asked Jun 22 '23 at 15:30

oprog

126
1
8

0

votes

1 answer

Google Speech to text APIs returns only one side of the conversation

I am using Google APIs speech-to-text to transcript audio files (wav files) that are stored in GCS bucket. The audio files are phone records and have 3 speakers ( IVR, Customer, and Engineer) and the transcripts i get either when using diarization…

python google-api google-speech-to-text-api diarization

asked May 24 '23 at 19:05

Ahmed Fahmy

1

0

votes

0 answers

How to use pretrained UIS-RNN model?

google have UIS-RNN model for Speaker Diarization (https://github.com/google/uis-rnn). In their examples they show how to train from scratch the proposed model. Is it possible to used google pretrained UIS-RNN model ? (instead of training the model…

deep-learning voice diarization speaker-diarization

asked Apr 02 '23 at 07:51

user3668129

4,318
6
45
87

0

votes

0 answers

Extracting speech of different speakers in overlapping speech using pyannote

I am using Pyannote for speaker diarization. I am able to get the overlapping speech's start and end time but not able to do voice separation. Is there a way to use Pyannote for voice separation? If not, please suggest some way to do voice…

speech-recognition voice-recognition diarization

asked Mar 06 '23 at 10:46

Giray Yağmur

1

0

votes

1 answer

Can speech diarization be be integrated with deepspeech?

In an online meeting such as Google Meet/ Zoom, I want to detect change of speaker and then transcribe the audio for different speakers. I am using Deepspeech model for speech to text. I have fine-tuned the model for Indian accent english but I want…

deep-learning speech-to-text mozilla-deepspeech diarization

asked Oct 17 '22 at 07:24

vaibhav jain

83
4

0

votes

0 answers

How to split 1 channel audio into 2 channels?

I have an audio file with two speakers on 1 channel. I would like to separate the audio in 2 channels (one per speaker). I was thinking of splitting on silences, or more complicated things like speaker diarization to i.e. to detect different…

python audio diarization audio-channels

asked Apr 28 '22 at 07:54

Lucas

1
1

Questions tagged [diarization]