The following article presents a novel, adaptive initialization scheme that can be applied to most state-ofthe-art Speaker Diarization algorithms, i.e. algorithms that use agglomerative hierarchical clustering with Bayesian Information Criterion (BIC) and Gaussian Mixture Models (GMMs) of frame-based cepstral features (MFCCs). The initialization method is a combination of the recently proposed “adaptive seconds per Gaussian” (ASPG) method and a new pre-clustering and number of initial clusters estimation method based on prosodic features. The presented initialization method has two important advantages. First, the method requires no manual tuning and is robust against file length and speaker count variations. Second, the method outperforms ...
This thesis describes research into speaker diarization for recorded meetings. It explores the algor...
Speaker diarization includes two steps: speaker segmentation and speaker clustering. Speaker segment...
This paper aims at investigating the use of sequential clustering for speaker diarization. Conventio...
This paper investigates a typical speaker diarization system regarding its robustness against initia...
Speaker Diarization is the process of partitioning an audio input into homogeneous segments accordin...
We investigate a state-of-the-art Speaker Diarization system regarding its behavior on meetings that...
The introduction of factor analysis techniques in a speaker diarization system enhances its performa...
This paper describes the LIMSI speaker diarization system used in the RT-04F evaluation. The RT-04F ...
The task of speaker diarization consists of answering the ques-tion “Who spoke when?”. The most comm...
International audienceThis paper describes recent advances in speaker diarization by incorporating a...
We present a novel model adaptation approach to deal with data variability for speaker diarization i...
Two new features have been proposed and used in the Rich Transcription Evaluation 2009 by the Univer...
The ever-expanding volume of available audio and multimedia data has elevated technologies related t...
This paper proposes the use of the Bayes Factor to replace the Bayesian Information Criterion (BIC) ...
This thesis describes research into speaker diarization for recorded meetings. It explores the algo...
This thesis describes research into speaker diarization for recorded meetings. It explores the algor...
Speaker diarization includes two steps: speaker segmentation and speaker clustering. Speaker segment...
This paper aims at investigating the use of sequential clustering for speaker diarization. Conventio...
This paper investigates a typical speaker diarization system regarding its robustness against initia...
Speaker Diarization is the process of partitioning an audio input into homogeneous segments accordin...
We investigate a state-of-the-art Speaker Diarization system regarding its behavior on meetings that...
The introduction of factor analysis techniques in a speaker diarization system enhances its performa...
This paper describes the LIMSI speaker diarization system used in the RT-04F evaluation. The RT-04F ...
The task of speaker diarization consists of answering the ques-tion “Who spoke when?”. The most comm...
International audienceThis paper describes recent advances in speaker diarization by incorporating a...
We present a novel model adaptation approach to deal with data variability for speaker diarization i...
Two new features have been proposed and used in the Rich Transcription Evaluation 2009 by the Univer...
The ever-expanding volume of available audio and multimedia data has elevated technologies related t...
This paper proposes the use of the Bayes Factor to replace the Bayesian Information Criterion (BIC) ...
This thesis describes research into speaker diarization for recorded meetings. It explores the algo...
This thesis describes research into speaker diarization for recorded meetings. It explores the algor...
Speaker diarization includes two steps: speaker segmentation and speaker clustering. Speaker segment...
This paper aims at investigating the use of sequential clustering for speaker diarization. Conventio...