In this paper we propose and evaluate a speaker attribution system using a complete-linkage clustering method. Speaker attribution refers to the annotation of a collection of spoken audio based on speaker identities. This can be achieved using diarization and speaker linking. The main challenge associated with attribution is achieving computational efficiency when dealing with large audio archives. Traditional agglomerative clustering methods with model merging and retraining are not feasible for this purpose. This has motivated the use of linkage clustering methods without retraining. We first propose a diarization system using complete-linkage clustering and show that it outperforms traditional agglomerative and single-linkage clustering ...
The purpose of this study is to develop robust techniques for speaker segmentation and clustering wi...
International audienceThis paper describes recent advances in speaker diarization by incorporating a...
Forensic audio does not seldom consist of long recordings of multiple speakers engaged in a dialogue...
This research makes a major contribution which enables efficient searching and indexing of large arc...
In this paper we extend the concept of speaker annotation within a single-recording, or speaker diar...
Performing speaker diarization of a collection of recordings, where speakers are uniquely identified...
We present a clustering-only approach to the problem of speaker diarization to eliminate the need fo...
Speaker diarization is the problem of determining "who spoke when" in an audio recording when the nu...
Abstract Speaker diarization of a collection of recordings with uniquely identified speakers is a ch...
Speaker diarization of a collection of recordings with uniquely identified speakers is a challenging...
This paper investigates the problem of automatically grouping unknown speech utterances based on the...
International audienceAbstract:This paper describes recent advances in speaker diarization with a mu...
Speaker diarization is the process of annotating an input audio with information that attributes tem...
The goal in Speaker Diarization (SD) is to answer the question "Who spoke when?" for a given audio w...
In this paper we propose a novel scheme for carrying out speaker diarization in an iterative manner....
The purpose of this study is to develop robust techniques for speaker segmentation and clustering wi...
International audienceThis paper describes recent advances in speaker diarization by incorporating a...
Forensic audio does not seldom consist of long recordings of multiple speakers engaged in a dialogue...
This research makes a major contribution which enables efficient searching and indexing of large arc...
In this paper we extend the concept of speaker annotation within a single-recording, or speaker diar...
Performing speaker diarization of a collection of recordings, where speakers are uniquely identified...
We present a clustering-only approach to the problem of speaker diarization to eliminate the need fo...
Speaker diarization is the problem of determining "who spoke when" in an audio recording when the nu...
Abstract Speaker diarization of a collection of recordings with uniquely identified speakers is a ch...
Speaker diarization of a collection of recordings with uniquely identified speakers is a challenging...
This paper investigates the problem of automatically grouping unknown speech utterances based on the...
International audienceAbstract:This paper describes recent advances in speaker diarization with a mu...
Speaker diarization is the process of annotating an input audio with information that attributes tem...
The goal in Speaker Diarization (SD) is to answer the question "Who spoke when?" for a given audio w...
In this paper we propose a novel scheme for carrying out speaker diarization in an iterative manner....
The purpose of this study is to develop robust techniques for speaker segmentation and clustering wi...
International audienceThis paper describes recent advances in speaker diarization by incorporating a...
Forensic audio does not seldom consist of long recordings of multiple speakers engaged in a dialogue...