Spoken dialog systems have matured to the point where the underlying technologies of speech recognition, natural language understanding, dialog management, natural language generation, and speech synthesis are available for many languages and are portable to various domains. However, when people communicate, spoken language is accompanied by a host of other inputs and outputs. The most immediately salient of these processes is input and output perceived visually: gestures, eye gaze, posture, and the like. We focus on three kinds of visual input: the first can be roughly called visual awareness, and includes factors such as what objects (including people) are in the visual scene, how far away they are from the computer, and so forth. Cer...
This paper focusses on three components of the dialogue system HAM-RPM, which converses in natural l...
Although language usually occurs in an interactive and world-situated context (Clark, 1996), most re...
1 Introduction 2 Vision-Based Input Francis K.H. Quek Vision Interfaces and Systems Laboratory (VIS...
that visual perception can influence and be influenced by concurrent linguistic input has prompted t...
Recent years have witnessed a growing interest in multimodal features of language use, both for theo...
Speech has been used as the foundation for many human/machine interactive systems to convey the user...
The human’s ability to see, listen and speak is naturally embedded in how we interact and communicat...
We investigate the visual and vocal modalities of interaction with computer systems. We focus our at...
When humans converse with each other, they naturally amal-gamate information from multiple modalitie...
Gestures and visible speech cues are often available to listeners to aid their comprehension of the ...
Recent years have witnessed a growing interest in multimodal features of spoken language (Müller et ...
Since listeners usually look at the speaker's face, gestural information has to be absorbed through ...
We present a summary overview of recent work using eye movement data to improve speech technologies....
This paper explores the role of verbal and nonverbal resources for the management of turn taking in ...
Since the pioneering work by Kendon (1967), researchers across disciplines have shown a growing inte...
This paper focusses on three components of the dialogue system HAM-RPM, which converses in natural l...
Although language usually occurs in an interactive and world-situated context (Clark, 1996), most re...
1 Introduction 2 Vision-Based Input Francis K.H. Quek Vision Interfaces and Systems Laboratory (VIS...
that visual perception can influence and be influenced by concurrent linguistic input has prompted t...
Recent years have witnessed a growing interest in multimodal features of language use, both for theo...
Speech has been used as the foundation for many human/machine interactive systems to convey the user...
The human’s ability to see, listen and speak is naturally embedded in how we interact and communicat...
We investigate the visual and vocal modalities of interaction with computer systems. We focus our at...
When humans converse with each other, they naturally amal-gamate information from multiple modalitie...
Gestures and visible speech cues are often available to listeners to aid their comprehension of the ...
Recent years have witnessed a growing interest in multimodal features of spoken language (Müller et ...
Since listeners usually look at the speaker's face, gestural information has to be absorbed through ...
We present a summary overview of recent work using eye movement data to improve speech technologies....
This paper explores the role of verbal and nonverbal resources for the management of turn taking in ...
Since the pioneering work by Kendon (1967), researchers across disciplines have shown a growing inte...
This paper focusses on three components of the dialogue system HAM-RPM, which converses in natural l...
Although language usually occurs in an interactive and world-situated context (Clark, 1996), most re...
1 Introduction 2 Vision-Based Input Francis K.H. Quek Vision Interfaces and Systems Laboratory (VIS...