The problem of tracking multiple moving speakers in indoor environments has received much attention. Earlier techniques were based purely on a single modality, e.g., vision. Recently, the fusion of multi-modal information has been shown to be instrumental in improving tracking performance, as well as robustness in the case of challenging situations like occlusions (by the limited field of view of cameras or by other speakers). However, data fusion algorithms often suffer from noise corrupting the sensor measurements which cause non-negligible detection errors. Here, a novel approach to combining audio and visual data is proposed. We employ the direction of arrival angles of the audio sources to reshape the typical Gaussian noise distributio...
It is often advantageous to track objects in a scene using multimodal information when such informat...
It is often advantageous to track objects in a scene using multimodal information when such informat...
We propose a multi-modal object tracking algorithm that combines appearance, motion and audio inform...
The problem of tracking multiple moving speakers in indoor environments has received much attention....
The problem of tracking multiple moving speakers in indoor environments has received much attention....
Abstract—The problem of tracking multiple moving speakers in indoor environments has received much a...
Abstract—The problem of tracking multiple moving speakers in indoor environments has recently receiv...
We present a robust and efficient audio-visual (AV) approach to speaker tracking in a room environme...
We present a robust and efficient audio-visual (AV) approach to speaker tracking in a room environme...
In this thesis, a novel approach is proposed for multi-speaker tracking by integrating audio and vis...
In this thesis, a novel approach is proposed for multi-speaker tracking by integrating audio and vis...
Particle filtering has emerged as a useful tool for tracking problems. However, the efficiency and a...
Particle filtering has emerged as a useful tool for tracking problems. However, the efficiency and a...
We propose an audio-visual fusion algorithm for 3D speaker tracking from a localised multi-modal sen...
Particle filtering has emerged as a useful tool for track-ing problems. However, the efficiency and ...
It is often advantageous to track objects in a scene using multimodal information when such informat...
It is often advantageous to track objects in a scene using multimodal information when such informat...
We propose a multi-modal object tracking algorithm that combines appearance, motion and audio inform...
The problem of tracking multiple moving speakers in indoor environments has received much attention....
The problem of tracking multiple moving speakers in indoor environments has received much attention....
Abstract—The problem of tracking multiple moving speakers in indoor environments has received much a...
Abstract—The problem of tracking multiple moving speakers in indoor environments has recently receiv...
We present a robust and efficient audio-visual (AV) approach to speaker tracking in a room environme...
We present a robust and efficient audio-visual (AV) approach to speaker tracking in a room environme...
In this thesis, a novel approach is proposed for multi-speaker tracking by integrating audio and vis...
In this thesis, a novel approach is proposed for multi-speaker tracking by integrating audio and vis...
Particle filtering has emerged as a useful tool for tracking problems. However, the efficiency and a...
Particle filtering has emerged as a useful tool for tracking problems. However, the efficiency and a...
We propose an audio-visual fusion algorithm for 3D speaker tracking from a localised multi-modal sen...
Particle filtering has emerged as a useful tool for track-ing problems. However, the efficiency and ...
It is often advantageous to track objects in a scene using multimodal information when such informat...
It is often advantageous to track objects in a scene using multimodal information when such informat...
We propose a multi-modal object tracking algorithm that combines appearance, motion and audio inform...