Wachsmuth S. Multi-modal scene understanding using probabilistic models. Bielefeld (Germany): Bielefeld University; 2001.This thesis addresses the problem of relating spoken utterances to the simultaneously perceived visual scene context. The development of systems that integrate verbal and visual information is an extending field of research. It is pushed by various applications like the indexing and querying of video databases, service robotics, augmented reality, document analysis, documentation systems with multi-modal interfaces, or other multi-media systems. Each of these applications has to relate two or more different input modalities, which is also known as the correspondence problem. The task of relating realistic inputs like spee...
In order to communicate with their users in a natural and effec-tive manner, humanlike robots must s...
In this paper, we propose a method based on Bayesian networks for interpretation of multimodal signa...
Wachsmuth S, Sagerer G. Integrated Analysis of Speech and Images as a Probabilistic Decoding Process...
Wachsmuth S, Socher G, Brandt-Pook H, Kummert F, Sagerer G. Integration of Vision and Speech Underst...
Wachsmuth S, Sagerer G. Bayesian Networks for Speech and Image Integration. In: Proc. of 18th Natio...
Wachsmuth S, Brandt-Pook H, Socher G, Kummert F, Sagerer G. Multilevel Integration of Vision and Spe...
Wachsmuth S, Fink GA, Kummert F, Sagerer G. Using Speech in Visual Object Recognition. Informatik Ak...
Wachsmuth S, Swadzba A. Probabilistic Scene Modeling for Situated Computer Vision. In: Cohn AG, Hogg...
In this paper, we introduce Bayesian networks architecture for combining speech-based information wi...
Multimodal conversation modeling is an important and challenging problem when building conversationa...
International audienceThanks to their different senses, human observers acquire multiple information...
Today, technology allows us to produce extensive multimodal systems which are totally under human co...
In this paper, we propose a Bayesian network framework for managing interactivity between a tour-gui...
We present a novel probabilistic framework that fuses information coming from the audio and video mo...
160 p.Thesis (Ph.D.)--University of Illinois at Urbana-Champaign, 1999.Recent advances in various di...
In order to communicate with their users in a natural and effec-tive manner, humanlike robots must s...
In this paper, we propose a method based on Bayesian networks for interpretation of multimodal signa...
Wachsmuth S, Sagerer G. Integrated Analysis of Speech and Images as a Probabilistic Decoding Process...
Wachsmuth S, Socher G, Brandt-Pook H, Kummert F, Sagerer G. Integration of Vision and Speech Underst...
Wachsmuth S, Sagerer G. Bayesian Networks for Speech and Image Integration. In: Proc. of 18th Natio...
Wachsmuth S, Brandt-Pook H, Socher G, Kummert F, Sagerer G. Multilevel Integration of Vision and Spe...
Wachsmuth S, Fink GA, Kummert F, Sagerer G. Using Speech in Visual Object Recognition. Informatik Ak...
Wachsmuth S, Swadzba A. Probabilistic Scene Modeling for Situated Computer Vision. In: Cohn AG, Hogg...
In this paper, we introduce Bayesian networks architecture for combining speech-based information wi...
Multimodal conversation modeling is an important and challenging problem when building conversationa...
International audienceThanks to their different senses, human observers acquire multiple information...
Today, technology allows us to produce extensive multimodal systems which are totally under human co...
In this paper, we propose a Bayesian network framework for managing interactivity between a tour-gui...
We present a novel probabilistic framework that fuses information coming from the audio and video mo...
160 p.Thesis (Ph.D.)--University of Illinois at Urbana-Champaign, 1999.Recent advances in various di...
In order to communicate with their users in a natural and effec-tive manner, humanlike robots must s...
In this paper, we propose a method based on Bayesian networks for interpretation of multimodal signa...
Wachsmuth S, Sagerer G. Integrated Analysis of Speech and Images as a Probabilistic Decoding Process...