It is often advantageous to track objects in a scene using multimodal information when such information is available. We use audio as a complementary modality to video data, which, in comparison to vision, can provide faster localization over a wider field of view. We present a particle-filter based tracking framework for performing multimodal sensor fusion for tracking people in a videoconferencing environment using multiple cameras and multiple microphone arrays. One advantage of our proposed tracker is its ability to seamlessly handle temporary absence of some measurements (e.g., camera occlusion or silence). Another advantage is the possibility of self-calibration of the joint system to compensate for imprecision in the knowledge of arr...
People tracking has received considerable attention as a research field recently. Since, there are a...
In this thesis, a novel approach is proposed for multi-speaker tracking by integrating audio and vis...
We present a robust and efficient audio-visual (AV) approach to speaker tracking in a room environme...
It is often advantageous to track objects in a scene using multimodal information when such informat...
Abstract—The problem of tracking multiple moving speakers in indoor environments has recently receiv...
The problem of tracking multiple moving speakers in indoor environments has received much attention....
The problem of tracking multiple moving speakers in indoor environments has received much attention....
The problem of tracking multiple moving speakers in indoor environments has received much attention....
Tracking speakers in multi-party conversations represents an important step towards automatic analys...
We propose an audio-visual fusion algorithm for 3D speaker tracking from a localised multi-modal sen...
Abstract—The problem of tracking multiple moving speakers in indoor environments has received much a...
In this paper, we present a novel approach for tracking a lecturer during the course of his speech. ...
We propose a multi-modal object tracking algorithm that combines appearance, motion and audio inform...
Situational awareness is achieved naturally by the human senses of sight and hearing in combination...
We present a robust and efficient audio-visual (AV) approach to speaker tracking in a room environme...
People tracking has received considerable attention as a research field recently. Since, there are a...
In this thesis, a novel approach is proposed for multi-speaker tracking by integrating audio and vis...
We present a robust and efficient audio-visual (AV) approach to speaker tracking in a room environme...
It is often advantageous to track objects in a scene using multimodal information when such informat...
Abstract—The problem of tracking multiple moving speakers in indoor environments has recently receiv...
The problem of tracking multiple moving speakers in indoor environments has received much attention....
The problem of tracking multiple moving speakers in indoor environments has received much attention....
The problem of tracking multiple moving speakers in indoor environments has received much attention....
Tracking speakers in multi-party conversations represents an important step towards automatic analys...
We propose an audio-visual fusion algorithm for 3D speaker tracking from a localised multi-modal sen...
Abstract—The problem of tracking multiple moving speakers in indoor environments has received much a...
In this paper, we present a novel approach for tracking a lecturer during the course of his speech. ...
We propose a multi-modal object tracking algorithm that combines appearance, motion and audio inform...
Situational awareness is achieved naturally by the human senses of sight and hearing in combination...
We present a robust and efficient audio-visual (AV) approach to speaker tracking in a room environme...
People tracking has received considerable attention as a research field recently. Since, there are a...
In this thesis, a novel approach is proposed for multi-speaker tracking by integrating audio and vis...
We present a robust and efficient audio-visual (AV) approach to speaker tracking in a room environme...