The purpose of this study is to develop robust techniques for speaker segmentation and clustering with focus on meetings domain. The techniques examined can however be applied to any other domains such as telephone and broadcast news. Traditional techniques for speaker diarization developed for telephone conversations or broadcast news are based on a single channel, which is notably different from meetings domain which can have multiple channels. These techniques when adapted to meetings domain however perform poorer than expected since they do not exploit direction of arrival information, which is available in many meeting rooms with the presence of multiple microphones. Moreover, many of these techniques are involved with tunable paramete...
This survey focuses on two challenging speech processing topics, namely: speaker segmentation and sp...
With the increase in cheap commercially available sensors, recording meetings is becoming an increas...
Speaker diarization is the problem of determining "who spoke when" in an audio recording when the nu...
Given a piece of audio recording, the task of speaker diarization can be summarized as answering the...
In this paper, we investigate new approaches to improve speech activity detection, speaker segmentat...
UnrestrictedSpeaker clustering refers to a process of classifying a set of input speech data (or spe...
International audienceAbstract:This paper describes recent advances in speaker diarization with a mu...
submitted for publication Abstract. In this paper, we investigate the use of agglomerative Informati...
In any speaker diarization system there is a segmentation phase and a clustering phase. Our system u...
Abstract. In this paper we describe the ICSI-SRI entry in the Rich Transcription 2005 Spring Meeting...
In this paper, we investigate the use of agglomerative Information Bottleneck (aIB) clustering for t...
Abstract. In this paper we present the ICSI speaker diarization system submitted for the NIST Rich T...
In this paper we describe the AMIDA speaker dizarization system as it was submitted to the NIST Rich...
ABSTRACT We investigate using state-of-the-art speaker diarization output for speech recognition pur...
With the increase in cheap commercially available sensors, recording meetings is becoming an increas...
This survey focuses on two challenging speech processing topics, namely: speaker segmentation and sp...
With the increase in cheap commercially available sensors, recording meetings is becoming an increas...
Speaker diarization is the problem of determining "who spoke when" in an audio recording when the nu...
Given a piece of audio recording, the task of speaker diarization can be summarized as answering the...
In this paper, we investigate new approaches to improve speech activity detection, speaker segmentat...
UnrestrictedSpeaker clustering refers to a process of classifying a set of input speech data (or spe...
International audienceAbstract:This paper describes recent advances in speaker diarization with a mu...
submitted for publication Abstract. In this paper, we investigate the use of agglomerative Informati...
In any speaker diarization system there is a segmentation phase and a clustering phase. Our system u...
Abstract. In this paper we describe the ICSI-SRI entry in the Rich Transcription 2005 Spring Meeting...
In this paper, we investigate the use of agglomerative Information Bottleneck (aIB) clustering for t...
Abstract. In this paper we present the ICSI speaker diarization system submitted for the NIST Rich T...
In this paper we describe the AMIDA speaker dizarization system as it was submitted to the NIST Rich...
ABSTRACT We investigate using state-of-the-art speaker diarization output for speech recognition pur...
With the increase in cheap commercially available sensors, recording meetings is becoming an increas...
This survey focuses on two challenging speech processing topics, namely: speaker segmentation and sp...
With the increase in cheap commercially available sensors, recording meetings is becoming an increas...
Speaker diarization is the problem of determining "who spoke when" in an audio recording when the nu...