i-vectors have been successfully applied over the last years in speaker recognition tasks. This work aims at assessing the suitability of i-vector modeling within the frame of speaker diarization task. In such context, a weighted cosine-distance between two different sets of i-vectors is proposed for speaker clustering. Speech clusters generated by Viterbi segmentation are first modeled by two different i-vectors. Whilst the first i-vector represents the distribution of the commonly used short-term Mel Frequency Cepstral Coefficients, the second one depicts a selection of voice quality and prosodic features. In order to combine both short- and long-term speech statistics, the cosine-distance scores of those two i-vectors are linearly weight...
The purpose of this study is to develop robust techniques for speaker segmentation and clustering wi...
In this paper, we present a hybrid speaker-based segmentation, which combines metric-based and model...
Gaussian mixture models (GMMs) are commonly used in text-independent speaker identification systems....
i-vectors have been successfully applied over the last years in speaker recognition tasks. This work...
i-vector modeling techniques have been successfully used for speaker clustering task recently. In th...
i-vector modeling techniques have been successfully used for speaker clustering task recently. In th...
Abstract Several factors contribute to the performance of speaker diarization systems. For instance,...
In this paper we present our system for speaker diarization of broad-cast news based on recent advan...
International audienceWe propose to study speaker diarization from a collection of audio documents. ...
This paper extends upon our previous work using i-vectors for speaker diarization. We examine the ef...
Speaker diarization has received several research attentions over the last decade. Among the differe...
UnrestrictedSpeaker clustering refers to a process of classifying a set of input speech data (or spe...
The development in the interface of smart devices has lead to voice interactive systems. An addition...
The process of manually labeling data is very expensive and sometimes infeasible due to privacy and ...
In this paper, we present an application of student’s t-test to measure the similarity between two s...
The purpose of this study is to develop robust techniques for speaker segmentation and clustering wi...
In this paper, we present a hybrid speaker-based segmentation, which combines metric-based and model...
Gaussian mixture models (GMMs) are commonly used in text-independent speaker identification systems....
i-vectors have been successfully applied over the last years in speaker recognition tasks. This work...
i-vector modeling techniques have been successfully used for speaker clustering task recently. In th...
i-vector modeling techniques have been successfully used for speaker clustering task recently. In th...
Abstract Several factors contribute to the performance of speaker diarization systems. For instance,...
In this paper we present our system for speaker diarization of broad-cast news based on recent advan...
International audienceWe propose to study speaker diarization from a collection of audio documents. ...
This paper extends upon our previous work using i-vectors for speaker diarization. We examine the ef...
Speaker diarization has received several research attentions over the last decade. Among the differe...
UnrestrictedSpeaker clustering refers to a process of classifying a set of input speech data (or spe...
The development in the interface of smart devices has lead to voice interactive systems. An addition...
The process of manually labeling data is very expensive and sometimes infeasible due to privacy and ...
In this paper, we present an application of student’s t-test to measure the similarity between two s...
The purpose of this study is to develop robust techniques for speaker segmentation and clustering wi...
In this paper, we present a hybrid speaker-based segmentation, which combines metric-based and model...
Gaussian mixture models (GMMs) are commonly used in text-independent speaker identification systems....