This thesis represents Bayesian joint audio-visual tracking for the 3D locations of multiple people and a current speaker in a real conference environment. To achieve this objective, it focuses on several different research interests, such as acoustic-feature detection, visual-feature detection, a non-linear Bayesian framework, data association, and sensor fusion. As acoustic-feature detection, time-delay-of-arrival~(TDOA) estimation is used for multiple source detection. Localization performance using TDOAs is also analyzed according to different configurations of microphones. As a visual-feature detection, Viola-Jones face detection is used to initialize the locations of unknown multiple objects. Then, a corner feature, based on the resul...
Object tracking in real scenes is an important problem in computer vision due to increasing usage of...
Tracking speakers in multiparty conversations constitutes a fundamental task for automatic meeting a...
Acoustic source (speaker) tracking in the room environment plays an important role in many speech a...
Target tracking is a broad subject area extensively studied in many engineering disciplines. In this...
Visual tracking is a fundamental key to the recognition and analysis of human behaviour. In this th...
Best Paper AwardInternational audienceThis paper proposes a novel audiovisual tracking approach that...
Compact multi-sensor platforms are portable and thus desirable for robotics and personal-assistance ...
International audienceMultiple-speaker tracking is a crucial task for many applications. In real-wor...
The gain in popularity of massive open online courses and other online educational lectures prompts ...
It is often advantageous to track objects in a scene using multimodal information when such informat...
PhD ThesisThis thesis concerns the problem of target localization and tracking in an indoor environm...
This thesis deals with the problem of online visual tracking of multiple humans in an enclosed envir...
Tracking speakers in multi-party conversations represents an important step towards automatic analys...
Thesis (S.M.)--Massachusetts Institute of Technology, School of Architecture and Planning, Program i...
In this paper, we present a novel approach for tracking a lecturer during the course of his speech. ...
Object tracking in real scenes is an important problem in computer vision due to increasing usage of...
Tracking speakers in multiparty conversations constitutes a fundamental task for automatic meeting a...
Acoustic source (speaker) tracking in the room environment plays an important role in many speech a...
Target tracking is a broad subject area extensively studied in many engineering disciplines. In this...
Visual tracking is a fundamental key to the recognition and analysis of human behaviour. In this th...
Best Paper AwardInternational audienceThis paper proposes a novel audiovisual tracking approach that...
Compact multi-sensor platforms are portable and thus desirable for robotics and personal-assistance ...
International audienceMultiple-speaker tracking is a crucial task for many applications. In real-wor...
The gain in popularity of massive open online courses and other online educational lectures prompts ...
It is often advantageous to track objects in a scene using multimodal information when such informat...
PhD ThesisThis thesis concerns the problem of target localization and tracking in an indoor environm...
This thesis deals with the problem of online visual tracking of multiple humans in an enclosed envir...
Tracking speakers in multi-party conversations represents an important step towards automatic analys...
Thesis (S.M.)--Massachusetts Institute of Technology, School of Architecture and Planning, Program i...
In this paper, we present a novel approach for tracking a lecturer during the course of his speech. ...
Object tracking in real scenes is an important problem in computer vision due to increasing usage of...
Tracking speakers in multiparty conversations constitutes a fundamental task for automatic meeting a...
Acoustic source (speaker) tracking in the room environment plays an important role in many speech a...