An improvement is proposed in the audio-visual approach to solve the problem of source separation of physically moving speakers by exploiting multiple video cameras, a circular microphone array and robust spatial beamforming. The challenge of separating moving sources is that the mixing filters are time varying; as such the unmixing filters should also be time varying but these are difficult to determine from only audio measurements. Therefore the visual modality is utilized to track the direction of each speaker to the microphone array by using a Markov chain Monte Carlo particle filter (MCMC-PF). The proposed direction of arrival (DOA) tracker improves the computational complexity with respect to a previously employed 3-D multi-speaker po...
It is often advantageous to track objects in a scene using multimodal information when such informat...
The "cocktail party problem" has always been a challenging problem to solve and many blind source se...
We propose a multi-modal object tracking algorithm that combines appearance, motion and audio inform...
A novel multimodal solution is proposed to solve the problem of blind source separation (BSS) of mov...
A novel multimodal (audio-visual) approach to the problem of blind source separation (BSS) is evalua...
A novel multimodal (audio-visual) approach to the problem of blind source separation (BSS) is evalua...
A novel multimodal approach is proposed to solve the problem of blind source separation (BSS) of mov...
Blind audio source separation (BASS) is a fascinating problem that has been tackled from many differ...
A novel multimodal approach is proposed to solve the problem of blind source separation (BSS) of mov...
Abstract—The problem of tracking multiple moving speakers in indoor environments has received much a...
The problem of tracking multiple moving speakers in indoor environments has received much attention....
Abstract—The problem of tracking multiple moving speakers in indoor environments has recently receiv...
In this thesis, a novel approach is proposed for multi-speaker tracking by integrating audio and vis...
We present a robust and efficient audio-visual (AV) approach to speaker tracking in a room environme...
Audio-visual tracking of multiple speakers requires to estimate the state (e.g. velocity and locatio...
It is often advantageous to track objects in a scene using multimodal information when such informat...
The "cocktail party problem" has always been a challenging problem to solve and many blind source se...
We propose a multi-modal object tracking algorithm that combines appearance, motion and audio inform...
A novel multimodal solution is proposed to solve the problem of blind source separation (BSS) of mov...
A novel multimodal (audio-visual) approach to the problem of blind source separation (BSS) is evalua...
A novel multimodal (audio-visual) approach to the problem of blind source separation (BSS) is evalua...
A novel multimodal approach is proposed to solve the problem of blind source separation (BSS) of mov...
Blind audio source separation (BASS) is a fascinating problem that has been tackled from many differ...
A novel multimodal approach is proposed to solve the problem of blind source separation (BSS) of mov...
Abstract—The problem of tracking multiple moving speakers in indoor environments has received much a...
The problem of tracking multiple moving speakers in indoor environments has received much attention....
Abstract—The problem of tracking multiple moving speakers in indoor environments has recently receiv...
In this thesis, a novel approach is proposed for multi-speaker tracking by integrating audio and vis...
We present a robust and efficient audio-visual (AV) approach to speaker tracking in a room environme...
Audio-visual tracking of multiple speakers requires to estimate the state (e.g. velocity and locatio...
It is often advantageous to track objects in a scene using multimodal information when such informat...
The "cocktail party problem" has always been a challenging problem to solve and many blind source se...
We propose a multi-modal object tracking algorithm that combines appearance, motion and audio inform...