Speaker diarization of a collection of recordings with uniquely identified speakers is a challenging task. A system addressing such task must account for the inter-session variability present from recording to recording and it is asked to scale well to massive amounts of data. In this paper we use a two-stage approach to corpus-wide speaker diarization involving speaker diarization and speaker linking stages. The speaker linking system agglomeratively clusters speaker factor posterior distributions obtained via Joint Factor Analysis using the Ward method and the Hotteling t-square statistic as distance measure. We extend this framework to link speakers based on both speech and visual modalities to improve the robustness of the system. The s...
We present a novel probabilistic framework that fuses information coming from the audio and video mo...
In this paper we extend the concept of speaker annotation within a single-recording, or speaker diar...
International audienceSpeaker diarization consists of assigning speech signals to people engaged in ...
Abstract Speaker diarization of a collection of recordings with uniquely identified speakers is a ch...
Performing speaker diarization of a collection of recordings, where speakers are uniquely identified...
Abstract—Speaker diarization is the task of determining “who spoke when? ” in an audio or video reco...
The purpose of this study is to develop robust techniques for speaker segmentation and clustering wi...
In this paper we present a novel scheme for improving speaker diarization by making use of repeating...
Speaker diarization of meeting recordings is generally based on acoustic information ignoring that m...
Abstract-- Human-Machine interaction in meetings requires the localization and identification of the...
Speaker diarization is originally defined as the task of de-termining “who spoke when ” given an aud...
Given a piece of audio recording, the task of speaker diarization can be summarized as answering the...
International audienceNowadays, state-of-the-art speaker diarization and linking systems heavily rel...
In this paper we propose a novel scheme for carrying out speaker diarization in an iterative manner....
Speaker diarization for recordings made in meetings consists of identifying the number of participan...
We present a novel probabilistic framework that fuses information coming from the audio and video mo...
In this paper we extend the concept of speaker annotation within a single-recording, or speaker diar...
International audienceSpeaker diarization consists of assigning speech signals to people engaged in ...
Abstract Speaker diarization of a collection of recordings with uniquely identified speakers is a ch...
Performing speaker diarization of a collection of recordings, where speakers are uniquely identified...
Abstract—Speaker diarization is the task of determining “who spoke when? ” in an audio or video reco...
The purpose of this study is to develop robust techniques for speaker segmentation and clustering wi...
In this paper we present a novel scheme for improving speaker diarization by making use of repeating...
Speaker diarization of meeting recordings is generally based on acoustic information ignoring that m...
Abstract-- Human-Machine interaction in meetings requires the localization and identification of the...
Speaker diarization is originally defined as the task of de-termining “who spoke when ” given an aud...
Given a piece of audio recording, the task of speaker diarization can be summarized as answering the...
International audienceNowadays, state-of-the-art speaker diarization and linking systems heavily rel...
In this paper we propose a novel scheme for carrying out speaker diarization in an iterative manner....
Speaker diarization for recordings made in meetings consists of identifying the number of participan...
We present a novel probabilistic framework that fuses information coming from the audio and video mo...
In this paper we extend the concept of speaker annotation within a single-recording, or speaker diar...
International audienceSpeaker diarization consists of assigning speech signals to people engaged in ...