Abstract—Most current speaker diarization systems use ag-glomerative clustering of Gaussian Mixture Models (GMMs) to determine “who spoke when ” in an audio recording. While state-of-the-art in accuracy, this method is computationally costly, mostly due to the GMM training, and thus limits the performance of current approaches to be roughly real-time. Increased sizes of current datasets require processing of hundreds of hours of data and thus make more efficient processing methods highly desirable. With the emergence of highly parallel multicore and manycore processors, such as graphics processing units (GPUs), one can re-implement GMM training to achieve faster than real-time performance by taking advantage of parallelism in the training c...
Guided source separation (GSS) is a type of target-speaker extraction method that relies on pre-comp...
This paper describes the effort with building speaker-clustered acoustic models as a part of the rea...
Gaussian Mixture Model (GMM) statistics are required for maximum likelihood training as well as for ...
Gaussian Mixture Model (GMM) computations in modern Automatic Speech Recognition systems are known t...
The goal in Speaker Diarization (SD) is to answer the question "Who spoke when?" for a given audio w...
International audienceSpeaker diarization is the task of partitioning an audio stream into homogeneo...
Automatic speech recognition (ASR) is a very demanding computing task. Much research has been done i...
Most of current state-of-the-art speaker verification (SV) sys-tems use Gaussian mixture model (GMM)...
International audienceThis paper proposes a method for segmenting and clustering an audio flow on th...
This paper describes the LIMSI speaker diarization system used in the RT-04F evaluation. The RT-04F ...
Speaker Diarization is the process of partitioning an audio input into homogeneous segments accordin...
The speaker diarization task consists of inferring “who spoke when ” in an audio stream without any ...
Performing speaker diarization of a collection of recordings, where speakers are uniquely identified...
Speaker diarization is the problem of determining "who spoke when" in an audio recording when the nu...
International audienceThis paper describes recent advances in speaker diarization by incorporating a...
Guided source separation (GSS) is a type of target-speaker extraction method that relies on pre-comp...
This paper describes the effort with building speaker-clustered acoustic models as a part of the rea...
Gaussian Mixture Model (GMM) statistics are required for maximum likelihood training as well as for ...
Gaussian Mixture Model (GMM) computations in modern Automatic Speech Recognition systems are known t...
The goal in Speaker Diarization (SD) is to answer the question "Who spoke when?" for a given audio w...
International audienceSpeaker diarization is the task of partitioning an audio stream into homogeneo...
Automatic speech recognition (ASR) is a very demanding computing task. Much research has been done i...
Most of current state-of-the-art speaker verification (SV) sys-tems use Gaussian mixture model (GMM)...
International audienceThis paper proposes a method for segmenting and clustering an audio flow on th...
This paper describes the LIMSI speaker diarization system used in the RT-04F evaluation. The RT-04F ...
Speaker Diarization is the process of partitioning an audio input into homogeneous segments accordin...
The speaker diarization task consists of inferring “who spoke when ” in an audio stream without any ...
Performing speaker diarization of a collection of recordings, where speakers are uniquely identified...
Speaker diarization is the problem of determining "who spoke when" in an audio recording when the nu...
International audienceThis paper describes recent advances in speaker diarization by incorporating a...
Guided source separation (GSS) is a type of target-speaker extraction method that relies on pre-comp...
This paper describes the effort with building speaker-clustered acoustic models as a part of the rea...
Gaussian Mixture Model (GMM) statistics are required for maximum likelihood training as well as for ...