In this paper we propose a new appearance based system which consists of two stages: visual speech feature extraction and classification, followed by recognition of the extracted feature, thereby the result is a complete lip-reading system. This lip-reading system employs our Hyper Column Model (HCM) approach to extract and classify the visual features and uses the Hidden Markov Model (HMM) for recognition. This paper addresses mainly the first stage; i.e. feature extraction and classification. We investigate the HCM performance to achieve feature extraction and classification and then compare the performance when replacing HCM with Fast Discrete Cosine Transform (FDCT). Unlike FDCT, HCM could extract the entire features without any loss. A...
This chapter focuses on a brief introduction on the origins of the audio-visual speech recognition p...
The performance of automatic speech recognition (ASR) system can be significantly enhanced with addi...
This paper describes a feature-fusion audio-visual speech recognition (AVSR) system that extracts li...
A fundamental task in pattern recognition field is to find a suitable representation for a feature. ...
The multimodal nature of speech is often ignored in human-computer interaction, but lip deformations...
Abstract—The multimodal nature of speech is often ignored in human-computer interaction, but lip def...
One of the most challenging tasks in automatic visual speech recognition is the extraction of featur...
One of the most challenging tasks in automatic visual speech recognition is the extraction of featur...
Abstract. In this paper an evaluation of visual speech features is performed specifically for the ta...
This thesis describes how an automatic lip reader was realized. Visual speech recognition is a preco...
Visual Speech Recognition (VSR) deals with the task of extracting speech information from visual cue...
Visual Speech Recognition (VSR) deals with the task of extracting speech information from visual cue...
This paper presents the development of a novel visual speech recognition (VSR) system based on a new...
This paper presents the development of a novel visual speech recognition (VSR) system based on a new...
This paper presents the development of a novel visual speech recognition (VSR) system based on a new...
This chapter focuses on a brief introduction on the origins of the audio-visual speech recognition p...
The performance of automatic speech recognition (ASR) system can be significantly enhanced with addi...
This paper describes a feature-fusion audio-visual speech recognition (AVSR) system that extracts li...
A fundamental task in pattern recognition field is to find a suitable representation for a feature. ...
The multimodal nature of speech is often ignored in human-computer interaction, but lip deformations...
Abstract—The multimodal nature of speech is often ignored in human-computer interaction, but lip def...
One of the most challenging tasks in automatic visual speech recognition is the extraction of featur...
One of the most challenging tasks in automatic visual speech recognition is the extraction of featur...
Abstract. In this paper an evaluation of visual speech features is performed specifically for the ta...
This thesis describes how an automatic lip reader was realized. Visual speech recognition is a preco...
Visual Speech Recognition (VSR) deals with the task of extracting speech information from visual cue...
Visual Speech Recognition (VSR) deals with the task of extracting speech information from visual cue...
This paper presents the development of a novel visual speech recognition (VSR) system based on a new...
This paper presents the development of a novel visual speech recognition (VSR) system based on a new...
This paper presents the development of a novel visual speech recognition (VSR) system based on a new...
This chapter focuses on a brief introduction on the origins of the audio-visual speech recognition p...
The performance of automatic speech recognition (ASR) system can be significantly enhanced with addi...
This paper describes a feature-fusion audio-visual speech recognition (AVSR) system that extracts li...