Segmenting different individuals in a group meeting and their speech is an important first step for various tasks such as meeting transcription, automatic camera panning, mul-timedia retrieval and monologue detection. In this effort, given a meeting room video, we attempt to segment individ-ual person’s speech and localize them in the video, based on data from a single audio and video source. The segmenta-tion method is driven by audio and enhanced by video cues. We used Bayesian Information Criterion (BIC) to segment the feature vector streams and graph spectral partitioning to cluster them. We compare our results with audio based segmentation method and our localization technique with the commonly used mutual information. 1
The following article presents a novel audio-visual approach for unsupervised speaker localization i...
The paper concentrates on speaker diarization over meeting recordings. The task of speaker diarizati...
Audio segmentation, in general, is the task of segmenting a continuous audio stream in terms of acou...
The purpose of this paper is to present a system which breaks input speech into segments and identif...
Abstract This chapter presents novel computationally efficient algorithms to extract semantically me...
This dissertation documents the research performed on the topics of localization, diarization and in...
Meetings, common to many business environments, generally involve stationary participants. Thus, par...
This dissertation documents the research performed on the topics of localization, diarization and in...
This dissertation documents the research performed on the topics of localization, diarization and in...
Given a piece of audio recording, the task of speaker diarization can be summarized as answering the...
Multiparty meetings generally involve stationary participants. Participant location information can ...
Given a piece of audio recording, the task of speaker diarization can be summarized as answering the...
Automatic analysis of conversations is important for extracting high-level descriptions of meetings....
Automatic analysis of conversations is important for extracting high-level descriptions of meetings....
This paper investigates the automatic segmentation of meetings into a sequence of group actions or p...
The following article presents a novel audio-visual approach for unsupervised speaker localization i...
The paper concentrates on speaker diarization over meeting recordings. The task of speaker diarizati...
Audio segmentation, in general, is the task of segmenting a continuous audio stream in terms of acou...
The purpose of this paper is to present a system which breaks input speech into segments and identif...
Abstract This chapter presents novel computationally efficient algorithms to extract semantically me...
This dissertation documents the research performed on the topics of localization, diarization and in...
Meetings, common to many business environments, generally involve stationary participants. Thus, par...
This dissertation documents the research performed on the topics of localization, diarization and in...
This dissertation documents the research performed on the topics of localization, diarization and in...
Given a piece of audio recording, the task of speaker diarization can be summarized as answering the...
Multiparty meetings generally involve stationary participants. Participant location information can ...
Given a piece of audio recording, the task of speaker diarization can be summarized as answering the...
Automatic analysis of conversations is important for extracting high-level descriptions of meetings....
Automatic analysis of conversations is important for extracting high-level descriptions of meetings....
This paper investigates the automatic segmentation of meetings into a sequence of group actions or p...
The following article presents a novel audio-visual approach for unsupervised speaker localization i...
The paper concentrates on speaker diarization over meeting recordings. The task of speaker diarizati...
Audio segmentation, in general, is the task of segmenting a continuous audio stream in terms of acou...