In this paper we present a sound probabilistic approach to speaker diarization. We use a hybrid framework where a distribution over the number of speakers at each point of a multimodal stream is estimated with a discriminative model. The output of this process is used as input in a generative model that can adapt to a novel test set and perform high accuracy speaker diarization. We manage to deal efficiently with the less common, and therefore harder, segments like silence and multiple speaker parts in a principled probabilistic manner
Thesis (S.M.)--Massachusetts Institute of Technology, Dept. of Electrical Engineering and Computer S...
Speaker diarization (SD) involves the detection of speakers within an audio stream and the intervals...
Recently, a fully supervised speaker diarization approach was proposed (UIS-RNN) which models speake...
This chapter aims to present some of the recent Bayesian approaches to speaker diarization (SD). SD ...
Speaker Verification can be treated as a statistical hypothesis testing problem. The most commonly u...
International audienceThis paper proposes a method for segmenting and clustering an audio flow on th...
Abstract—Speaker diarization determines “who spoke when” from the recorded conversations of an unkno...
Inspired by recent success of speaker clustering in Total Variabil-ity space we propose a new probab...
We present a novel probabilistic framework that fuses information coming from the audio and video mo...
The paper describes a novel method that improvises the procedure for supervised speaker diarization....
The goal in Speaker Diarization (SD) is to answer the question "Who spoke when?" for a given audio w...
Speaker diarization systems process audio files by labelling speech segments according to speakers' ...
We consider the problem of speaker diarization, the problem of segmenting an audio recording of a me...
Speaker diarization is the process of annotating an input audio with information that attributes tem...
Forensic audio does not seldom consist of long recordings of multiple speakers engaged in a dialogue...
Thesis (S.M.)--Massachusetts Institute of Technology, Dept. of Electrical Engineering and Computer S...
Speaker diarization (SD) involves the detection of speakers within an audio stream and the intervals...
Recently, a fully supervised speaker diarization approach was proposed (UIS-RNN) which models speake...
This chapter aims to present some of the recent Bayesian approaches to speaker diarization (SD). SD ...
Speaker Verification can be treated as a statistical hypothesis testing problem. The most commonly u...
International audienceThis paper proposes a method for segmenting and clustering an audio flow on th...
Abstract—Speaker diarization determines “who spoke when” from the recorded conversations of an unkno...
Inspired by recent success of speaker clustering in Total Variabil-ity space we propose a new probab...
We present a novel probabilistic framework that fuses information coming from the audio and video mo...
The paper describes a novel method that improvises the procedure for supervised speaker diarization....
The goal in Speaker Diarization (SD) is to answer the question "Who spoke when?" for a given audio w...
Speaker diarization systems process audio files by labelling speech segments according to speakers' ...
We consider the problem of speaker diarization, the problem of segmenting an audio recording of a me...
Speaker diarization is the process of annotating an input audio with information that attributes tem...
Forensic audio does not seldom consist of long recordings of multiple speakers engaged in a dialogue...
Thesis (S.M.)--Massachusetts Institute of Technology, Dept. of Electrical Engineering and Computer S...
Speaker diarization (SD) involves the detection of speakers within an audio stream and the intervals...
Recently, a fully supervised speaker diarization approach was proposed (UIS-RNN) which models speake...