Multimodal interfaces require effective parsing and understanding of utterances whose content is dis-tributed across multiple input modes. Johnston 1998 presents an approach in which strategies for mul-timodal integration are stated declaratively using a unification-based grammar that is used by a multi-dimensional chart parser to compose inputs. This approach is highly expressive and supports a broad class of interfaces, but offers only limited potential for mutual compensation among the input modes, is subject to significant concerns in terms of computa-tional complexity, and complicates selection among alternative multimodal interpretations of the input. In this paper, we present an alternative approach in which multimodal parsing and un...
In recent years, systems have begun to emerge that use speech in the user interface. However, compar...
Bergmann K, Kopp S. Multimodal Content Representation for Speech and Gesture Production. In: Theune ...
Multiple layers of visual (and vocal) signals, plus their different onsets and offsets, represent a ...
Multimodal interfaces enable more natural and effective human-computer interaction by providing mult...
Multimodal systems can represent and manipulate semantics from different human communication modalit...
Natural communication between humans is not limited to speech, but often requires simultaneous coord...
Modern user interfaces can take advantage of multiple input modalities such as speech, gestures, han...
Multi-modal interfaces can achieve more natural and effective human-computer interaction by integrat...
The question addressed in this paper is simple. If the argumentative function of a multimodal narra...
Multimodal conversational interfaces provide a natural means for users to communi-cate with computer...
The use of multiple modes of user input to interact with computers and devices is an active area of ...
www.dfki.de/~wahlster Abstract. We introduce the notion of symmetric multimodality for dialogue syst...
Fink GA, Schillo C, Kummert F, Sagerer G. Incremental Speech Recognition for Multimodal Interfaces. ...
Demonstratives, in particular gestures that “only” accompany speech, are not a big issue in cur-rent...
This paper presents some recent developments at DISTInfoMus Lab on multimodal and cross-modal proces...
In recent years, systems have begun to emerge that use speech in the user interface. However, compar...
Bergmann K, Kopp S. Multimodal Content Representation for Speech and Gesture Production. In: Theune ...
Multiple layers of visual (and vocal) signals, plus their different onsets and offsets, represent a ...
Multimodal interfaces enable more natural and effective human-computer interaction by providing mult...
Multimodal systems can represent and manipulate semantics from different human communication modalit...
Natural communication between humans is not limited to speech, but often requires simultaneous coord...
Modern user interfaces can take advantage of multiple input modalities such as speech, gestures, han...
Multi-modal interfaces can achieve more natural and effective human-computer interaction by integrat...
The question addressed in this paper is simple. If the argumentative function of a multimodal narra...
Multimodal conversational interfaces provide a natural means for users to communi-cate with computer...
The use of multiple modes of user input to interact with computers and devices is an active area of ...
www.dfki.de/~wahlster Abstract. We introduce the notion of symmetric multimodality for dialogue syst...
Fink GA, Schillo C, Kummert F, Sagerer G. Incremental Speech Recognition for Multimodal Interfaces. ...
Demonstratives, in particular gestures that “only” accompany speech, are not a big issue in cur-rent...
This paper presents some recent developments at DISTInfoMus Lab on multimodal and cross-modal proces...
In recent years, systems have begun to emerge that use speech in the user interface. However, compar...
Bergmann K, Kopp S. Multimodal Content Representation for Speech and Gesture Production. In: Theune ...
Multiple layers of visual (and vocal) signals, plus their different onsets and offsets, represent a ...