Extraction of Visual Features for Lipreading

Matthews, I
Cootes, TF
Bangham, JA
Cox, SJ
Harvey, RW

Open link

Publication date

January 2002

DOI

10.1109/34.982900

Publisher

Institute of Electrical and Electronics Engineers (IEEE)

Abstract

The multimodal nature of speech is often ignored in human-computer interaction, but lip deformations and other body motion, such as those of the head, convey additional information. We integrate speech cues from many sources and this improves intelligibility, especially when the acoustic signal is degraded. The paper shows how this additional, often complementary, visual speech information can be used for speech recognition. Three methods for parameterizing lip image sequences for recognition using hidden Markov models are compared. Two of these are top-down approaches that fit a model of the inner and outer lip contours and derive lipreading features from a principal component analysis of shape or shape and appearance, respectively. The th...

Extracted data

We use cookies to provide a better user experience.

Data Protection

Extraction of Visual Features for Lipreading

Abstract

Extracted data

Extraction of Visual Features for Lipreading

Abstract

Extracted data

Related items

Related items