This thesis deals with many aspects of the bimodal speech processing research. It lays out the general framework of visually enhanced speech processing computer systems together with some insight in the human speech bimodal speech perception. There are three main contributions to the field of bimodal speech processing presented in the thesis. Firstly, a novel approach to visual feature extraction suitable for lipreading part of the speech processing system (Lip Geometry Estimation) is presented and described in full detail. Another new, powerful concept which is presented in this thesis is Person Independent Feature Space (PIFS), which is qualitatively analyzed on basis of real-life recorded material. The quantitative improvements obtained ...
This contribution is about the method for automatic lips reading from the video picture. The results...
We study three aspects of designing appearance based visual features for automatic lipreading: (a) T...
This paper describes the audio-visual database collected at AT&T Labs--Research for the study of...
This thesis deals with many aspects of the bimodal speech processing research. It lays out the gener...
This thesis describes how an automatic lip reader was realized. Visual speech recognition is a preco...
In the last two decades we witnessed a rapid increase of the computational power governed by Moore's...
This paper describes the gathering and availability of an audio-visual speech corpus for Dutch langu...
In the last two decades we witnessed a rapid increase of the computational power governed by Moore's...
This paper describes the gathering and availability of an audio-visual speech corpus for Dutch langu...
One of the most challenging tasks in automatic visual speech recognition is the extraction of featur...
One of the most challenging tasks in automatic visual speech recognition is the extraction of featur...
Abstract—The multimodal nature of speech is often ignored in human-computer interaction, but lip def...
The multimodal nature of speech is often ignored in human-computer interaction, but lip deformations...
Speech recognition solely based on visual information such as the lip shape and its movement is refe...
This paper describes recent speechreading experiments for a speaker independent continuous digit rec...
This contribution is about the method for automatic lips reading from the video picture. The results...
We study three aspects of designing appearance based visual features for automatic lipreading: (a) T...
This paper describes the audio-visual database collected at AT&T Labs--Research for the study of...
This thesis deals with many aspects of the bimodal speech processing research. It lays out the gener...
This thesis describes how an automatic lip reader was realized. Visual speech recognition is a preco...
In the last two decades we witnessed a rapid increase of the computational power governed by Moore's...
This paper describes the gathering and availability of an audio-visual speech corpus for Dutch langu...
In the last two decades we witnessed a rapid increase of the computational power governed by Moore's...
This paper describes the gathering and availability of an audio-visual speech corpus for Dutch langu...
One of the most challenging tasks in automatic visual speech recognition is the extraction of featur...
One of the most challenging tasks in automatic visual speech recognition is the extraction of featur...
Abstract—The multimodal nature of speech is often ignored in human-computer interaction, but lip def...
The multimodal nature of speech is often ignored in human-computer interaction, but lip deformations...
Speech recognition solely based on visual information such as the lip shape and its movement is refe...
This paper describes recent speechreading experiments for a speaker independent continuous digit rec...
This contribution is about the method for automatic lips reading from the video picture. The results...
We study three aspects of designing appearance based visual features for automatic lipreading: (a) T...
This paper describes the audio-visual database collected at AT&T Labs--Research for the study of...