This paper describes a complete system for audio-visual recognition of continuous speech including robust lip tracking, visual feature extraction, noise-robust acoustic feature extraction, and sensor integration. An appearance based model of the articulators, which represents linguistically important features, is learned from example images and is used to locate, track, and recover visual speech information. We tackle the problem of joint temporal modeling of the acoustic and visual speech signal by applying multi-stream hidden Markov models. This approach allows the definition of different temporal topologies and levels of stream integration and hence enables to model temporal dependencies more accurately than traditional approaches. We pr...
This paper focuses on combining audio-visual signals for Polish speech recognition in conditions of ...
This paper describes recent speechreading experiments for a speaker independent continuous digit rec...
This chapter focuses on a brief introduction on the origins of the audio-visual speech recognition p...
We address the problem of robust lip tracking, visual speech feature extraction, and sensor integrat...
The increase in the number of multimedia applications that require robust speech recognition systems...
With the increase in the computational complexity of recent computers, audio-visual speech recogniti...
Abstract—This paper presents the design and evaluation of a speaker-independent audio-visual speech ...
The Multi-Stream automatic speech recognition approach was investigated in this work as a framework ...
Abstract — Visual speech information from the speaker’s mouth region has been successfully shown to ...
In this paper, we present a new approach to visual speech recognition which improves contextual mode...
Speech recognition can be improved by using visual information in the form of lip movements of the s...
This work consists on designing a continuous speech recognition system using pattern recognition tec...
This paper gives an overview of the principles of a system for phoneme based, large vocabulary, cont...
Speechreading increases intelligibility in human speech perception. This suggests that conventional ...
Recent improvements are presented for phonetic decoding of continuous-speech from ultrasound and opt...
This paper focuses on combining audio-visual signals for Polish speech recognition in conditions of ...
This paper describes recent speechreading experiments for a speaker independent continuous digit rec...
This chapter focuses on a brief introduction on the origins of the audio-visual speech recognition p...
We address the problem of robust lip tracking, visual speech feature extraction, and sensor integrat...
The increase in the number of multimedia applications that require robust speech recognition systems...
With the increase in the computational complexity of recent computers, audio-visual speech recogniti...
Abstract—This paper presents the design and evaluation of a speaker-independent audio-visual speech ...
The Multi-Stream automatic speech recognition approach was investigated in this work as a framework ...
Abstract — Visual speech information from the speaker’s mouth region has been successfully shown to ...
In this paper, we present a new approach to visual speech recognition which improves contextual mode...
Speech recognition can be improved by using visual information in the form of lip movements of the s...
This work consists on designing a continuous speech recognition system using pattern recognition tec...
This paper gives an overview of the principles of a system for phoneme based, large vocabulary, cont...
Speechreading increases intelligibility in human speech perception. This suggests that conventional ...
Recent improvements are presented for phonetic decoding of continuous-speech from ultrasound and opt...
This paper focuses on combining audio-visual signals for Polish speech recognition in conditions of ...
This paper describes recent speechreading experiments for a speaker independent continuous digit rec...
This chapter focuses on a brief introduction on the origins of the audio-visual speech recognition p...