Bayesian networks are an attractive modeling tool for human sensing, as they combine an intuitive graphical representation with efficient algorithms for inference and learning. Earlier work has demonstrated that boosted parameter learning could be used to improve the performance of Bayesian network classi£ers for complex multi-modal inference problems such as speaker detection. In speaker detection, the goal is to use video and audio cues to infer when a person is speaking to a user interface. In this paper we introduce a new boosted structure learning algorithm based on AdaBoost. Given labeled data, our algorithm modi£es both the network structure and parameters so as to improve classification accuracy. We compare its performance to both s...
Multiple speaker localization algorithms generally require a bi-nary detector, which performs the so...
A stochastic approach based on Dynamic Bayesian Networks (DBNs) is introduced for spoken language un...
Speaker Recognition (SR) is a common task in AI-based sound analysis, involving structurally differe...
Design and development of novel human-computer in-terfaces poses a challenging problem: actions and ...
This paper presents a novel approach to automatic speaker recognition using dynamic Bayesian network...
Classification of real-world data poses a number of chal-lenging problems. Mismatch between classifi...
Improving the performance of Automated Speech Recognition system requires incorporating more knowled...
This paper describes the theory and implementation of Bayesian networks in the context of automatic ...
Colloque avec actes et comité de lecture. nationale.National audienceThis paper presents a novel app...
We present a novel probabilistic framework that fuses information coming from the audio and video mo...
The use of Bayesian networks for classification problems has received significant recent attention. ...
International audienceWe investigate the use of structure learning in Bayesian networks for a comple...
The use of visual features in audio-visual speech recognition (AVSR) is justified by both the speec...
We present a speech modeling methodology where no a priori assumption is made on the dependencies be...
Abstract—In audio-visual automatic speech recognition (AVASR) both acoustic and visual modalities of...
Multiple speaker localization algorithms generally require a bi-nary detector, which performs the so...
A stochastic approach based on Dynamic Bayesian Networks (DBNs) is introduced for spoken language un...
Speaker Recognition (SR) is a common task in AI-based sound analysis, involving structurally differe...
Design and development of novel human-computer in-terfaces poses a challenging problem: actions and ...
This paper presents a novel approach to automatic speaker recognition using dynamic Bayesian network...
Classification of real-world data poses a number of chal-lenging problems. Mismatch between classifi...
Improving the performance of Automated Speech Recognition system requires incorporating more knowled...
This paper describes the theory and implementation of Bayesian networks in the context of automatic ...
Colloque avec actes et comité de lecture. nationale.National audienceThis paper presents a novel app...
We present a novel probabilistic framework that fuses information coming from the audio and video mo...
The use of Bayesian networks for classification problems has received significant recent attention. ...
International audienceWe investigate the use of structure learning in Bayesian networks for a comple...
The use of visual features in audio-visual speech recognition (AVSR) is justified by both the speec...
We present a speech modeling methodology where no a priori assumption is made on the dependencies be...
Abstract—In audio-visual automatic speech recognition (AVASR) both acoustic and visual modalities of...
Multiple speaker localization algorithms generally require a bi-nary detector, which performs the so...
A stochastic approach based on Dynamic Bayesian Networks (DBNs) is introduced for spoken language un...
Speaker Recognition (SR) is a common task in AI-based sound analysis, involving structurally differe...