Natural communication between humans is not limited to speech, but often requires simultaneous coordination of multiple streams of information---especially hand gestures---to complement or supplement understanding. This thesis describes a software architecture, called CLAVIUS Whose purpose is to generically interpret multiple modes of input as singular semantic utterances through a modular programming interface that supports various sensing technologies. This interpretation is accomplished through a new multi-threaded parsing algorithm that co-ordinates top-down and bottom-up methods asynchronously on graph-based unification grammars. The interpretation process follows a best-first approach where partial parses are evaluated by a combinatio...
Thesis (Ph.D.)--University of Washington, 2015Robust language understanding systems have the potenti...
In recent years, the interest in research in speech understanding and spoken interaction has soared ...
International audienceEfficient processing of speech and language is required at all levels in the d...
Multimodal interfaces enable more natural and effective human-computer interaction by providing mult...
Multimodal interfaces require effective parsing and understanding of utterances whose content is dis...
Human communication is naturally multimodal. People normally interact through several communication ...
Multimodal systems can represent and manipulate semantics from different human communication modalit...
In this article we describe the collection and analysis of multilingual dialogs with a human or mach...
Modern user interfaces can take advantage of multiple input modalities such as speech, gestures, han...
Thanks to recent scientific advances, it is now possible to design multimodal interfaces allowing th...
What kinds of grammar or code are used in interactive communications with speech and gestures? How v...
All language use is a multimodal endeavor. Language processing must deal with both auditory and visu...
This paper focuses on the modelling of the linguistic level of MICRO, a multi-agents speech understa...
In this paper, I will argue that although the study of multimodal interaction offers exciting new pr...
As spoken language interfaces for real-world systems become a practical possibility, it has become a...
Thesis (Ph.D.)--University of Washington, 2015Robust language understanding systems have the potenti...
In recent years, the interest in research in speech understanding and spoken interaction has soared ...
International audienceEfficient processing of speech and language is required at all levels in the d...
Multimodal interfaces enable more natural and effective human-computer interaction by providing mult...
Multimodal interfaces require effective parsing and understanding of utterances whose content is dis...
Human communication is naturally multimodal. People normally interact through several communication ...
Multimodal systems can represent and manipulate semantics from different human communication modalit...
In this article we describe the collection and analysis of multilingual dialogs with a human or mach...
Modern user interfaces can take advantage of multiple input modalities such as speech, gestures, han...
Thanks to recent scientific advances, it is now possible to design multimodal interfaces allowing th...
What kinds of grammar or code are used in interactive communications with speech and gestures? How v...
All language use is a multimodal endeavor. Language processing must deal with both auditory and visu...
This paper focuses on the modelling of the linguistic level of MICRO, a multi-agents speech understa...
In this paper, I will argue that although the study of multimodal interaction offers exciting new pr...
As spoken language interfaces for real-world systems become a practical possibility, it has become a...
Thesis (Ph.D.)--University of Washington, 2015Robust language understanding systems have the potenti...
In recent years, the interest in research in speech understanding and spoken interaction has soared ...
International audienceEfficient processing of speech and language is required at all levels in the d...