Speech is the most important tool of interaction among human beings. This has inspired researchers to study further on speech recognition and develop a computer system that is able to integrate and understand human speech. But acoustic noisy environment can highly contaminate audio speech and affect the overall recognition performance. Thus, Audio-Visual Speech Recognition (AVSR) is designed to overcome the problems by utilising visual images which are unaffected by noise. The aim of this paper is to discuss the AVSR structures, which includes the front end processes, audio-visual data corpus used, recent works and accuracy estimation methods
Audio-visual automatic speech recognition (AVASR) is a speech recognition technique integrating audi...
NTCD-TIMIT: A New Database and Baseline for Noise-robust Audio-visual Speech Recognition Although a...
Audio-based automatic speech recognition (ASR) degrades significantly in noisy environments and is p...
Despite significant advances in the area of Automatic Speech Recognition, (ASR) systems still resul...
This paper implements and compares the performance of a number of techniques proposed for improving ...
This paper examines the utility of audio-visual speech for the two related tasks of speech and speak...
This paper describes audio-visual speech recognition system for Polish language and a set of perform...
The use of visual features in the form of lip movements to improve the performance of acoustic speec...
Abstract — Visual speech information from the speaker’s mouth region has been successfully shown to ...
In this thesis, a number of important issues relating to the use of both audio and video information...
Automatic speech recognition (ASR) holds the promise of providing a natural, efficient, and safer me...
Automatic speech recognition (ASR) permits effective interaction between humans and machines in envi...
We compare automatic recognition with human perception of audio-visual speech, in the large-vocabula...
Automatic speech recognition (ASR) performs well under restricted conditions, but performance degrad...
Education is a fundamental right that enriches everyone’s life. However, physically challenged peopl...
Audio-visual automatic speech recognition (AVASR) is a speech recognition technique integrating audi...
NTCD-TIMIT: A New Database and Baseline for Noise-robust Audio-visual Speech Recognition Although a...
Audio-based automatic speech recognition (ASR) degrades significantly in noisy environments and is p...
Despite significant advances in the area of Automatic Speech Recognition, (ASR) systems still resul...
This paper implements and compares the performance of a number of techniques proposed for improving ...
This paper examines the utility of audio-visual speech for the two related tasks of speech and speak...
This paper describes audio-visual speech recognition system for Polish language and a set of perform...
The use of visual features in the form of lip movements to improve the performance of acoustic speec...
Abstract — Visual speech information from the speaker’s mouth region has been successfully shown to ...
In this thesis, a number of important issues relating to the use of both audio and video information...
Automatic speech recognition (ASR) holds the promise of providing a natural, efficient, and safer me...
Automatic speech recognition (ASR) permits effective interaction between humans and machines in envi...
We compare automatic recognition with human perception of audio-visual speech, in the large-vocabula...
Automatic speech recognition (ASR) performs well under restricted conditions, but performance degrad...
Education is a fundamental right that enriches everyone’s life. However, physically challenged peopl...
Audio-visual automatic speech recognition (AVASR) is a speech recognition technique integrating audi...
NTCD-TIMIT: A New Database and Baseline for Noise-robust Audio-visual Speech Recognition Although a...
Audio-based automatic speech recognition (ASR) degrades significantly in noisy environments and is p...