The multimodal nature of speech is often ignored in human-computer interaction, but lip deformations and other body motion, such as those of the head, convey additional information. We integrate speech cues from many sources and this improves intelligibility, especially when the acoustic signal is degraded. The paper shows how this additional, often complementary, visual speech information can be used for speech recognition. Three methods for parameterizing lip image sequences for recognition using hidden Markov models are compared. Two of these are top-down approaches that fit a model of the inner and outer lip contours and derive lipreading features from a principal component analysis of shape or shape and appearance, respectively. The th...
We describe a robust method for locating and tracking lips in gray-level image sequences. Our approa...
In this paper we propose a new appearance based system which consists of two stages: visual speech f...
By identifying lip movements and characterizing their associations with speech sounds, the performan...
Abstract—The multimodal nature of speech is often ignored in human-computer interaction, but lip def...
This paper describes a new approach for speaker identification based on lipreading. Visual features ...
One of the most challenging tasks in automatic visual speech recognition is the extraction of featur...
One of the most challenging tasks in automatic visual speech recognition is the extraction of featur...
A fundamental task in pattern recognition field is to find a suitable representation for a feature. ...
We have designed and implemented a lipreading system which recognizes isolated words using only colo...
We describe a speechreading system that uses both, shape information from the lip contours and inten...
Humans are often able to compensate for noise degradation and uncertainty in speech information by a...
Speech recognition solely based on visual information such as the lip shape and its movement is refe...
Humans are often able to compensate for noise degradation and uncertainty in speech information by a...
Humans are often able to compensate for noise degradation and uncertainty in speech information by a...
Humans are often able to compensate for noise degradation and uncertainty in speech information by a...
We describe a robust method for locating and tracking lips in gray-level image sequences. Our approa...
In this paper we propose a new appearance based system which consists of two stages: visual speech f...
By identifying lip movements and characterizing their associations with speech sounds, the performan...
Abstract—The multimodal nature of speech is often ignored in human-computer interaction, but lip def...
This paper describes a new approach for speaker identification based on lipreading. Visual features ...
One of the most challenging tasks in automatic visual speech recognition is the extraction of featur...
One of the most challenging tasks in automatic visual speech recognition is the extraction of featur...
A fundamental task in pattern recognition field is to find a suitable representation for a feature. ...
We have designed and implemented a lipreading system which recognizes isolated words using only colo...
We describe a speechreading system that uses both, shape information from the lip contours and inten...
Humans are often able to compensate for noise degradation and uncertainty in speech information by a...
Speech recognition solely based on visual information such as the lip shape and its movement is refe...
Humans are often able to compensate for noise degradation and uncertainty in speech information by a...
Humans are often able to compensate for noise degradation and uncertainty in speech information by a...
Humans are often able to compensate for noise degradation and uncertainty in speech information by a...
We describe a robust method for locating and tracking lips in gray-level image sequences. Our approa...
In this paper we propose a new appearance based system which consists of two stages: visual speech f...
By identifying lip movements and characterizing their associations with speech sounds, the performan...