With the increase in the computational complexity of recent computers, audio-visual speech recognition (AVSR) became an attractive research topic that can lead to a robust solution for speech recognition in noisy environments. In the audio visual continuous speech recognition system presented in this paper, the audio and visual observation sequences are integrated using a coupled hidden Markov model (CHMM). The statistical properties of the CHMM can describe the asyncrony of the audio and visual features while preserv-ing their natural correlation over time. The experimental re-sults show that the current system tested on the XM2VTS database reduces the error rate of the audio only speech recognition system at SNR of 0db by over 55%. 1
Abstract—This paper presents the design and evaluation of a speaker-independent audio-visual speech ...
Stochastic signal processing techniques have pro-foundly changed our perspective on speech processin...
Recent years have been higher demands for automatic speech recognition (ASR) systems that are able t...
The increase in the number of multimedia applications that require robust speech recognition systems...
We address the problem of robust lip tracking, visual speech feature extraction, and sensor integrat...
This paper describes a complete system for audio-visual recognition of continuous speech including r...
The use of visual features in audio-visual speech recognition (AVSR) is justified by both the speech...
Extending automatic speech recognition (ASR) to the vi sual modality has been shown to greatly incre...
This work consists on designing a continuous speech recognition system using pattern recognition tec...
The use of visual features in audio-visual speech recognition (AVSR) is justified by both the speec...
This paper presents a novel Hidden Markov Model architecture to model the joint probability of pairs...
This paper presents a novel Hidden Markov Model architecture to model the joint probability of pair...
© 2016 IEEE.Automatic speech recognition (ASR) has become a widespread and convenient mode of human-...
This paper gives an overview of the principles of a system for phoneme based, large vocabulary, cont...
This paper focuses on combining audio-visual signals for Polish speech recognition in conditions of ...
Abstract—This paper presents the design and evaluation of a speaker-independent audio-visual speech ...
Stochastic signal processing techniques have pro-foundly changed our perspective on speech processin...
Recent years have been higher demands for automatic speech recognition (ASR) systems that are able t...
The increase in the number of multimedia applications that require robust speech recognition systems...
We address the problem of robust lip tracking, visual speech feature extraction, and sensor integrat...
This paper describes a complete system for audio-visual recognition of continuous speech including r...
The use of visual features in audio-visual speech recognition (AVSR) is justified by both the speech...
Extending automatic speech recognition (ASR) to the vi sual modality has been shown to greatly incre...
This work consists on designing a continuous speech recognition system using pattern recognition tec...
The use of visual features in audio-visual speech recognition (AVSR) is justified by both the speec...
This paper presents a novel Hidden Markov Model architecture to model the joint probability of pairs...
This paper presents a novel Hidden Markov Model architecture to model the joint probability of pair...
© 2016 IEEE.Automatic speech recognition (ASR) has become a widespread and convenient mode of human-...
This paper gives an overview of the principles of a system for phoneme based, large vocabulary, cont...
This paper focuses on combining audio-visual signals for Polish speech recognition in conditions of ...
Abstract—This paper presents the design and evaluation of a speaker-independent audio-visual speech ...
Stochastic signal processing techniques have pro-foundly changed our perspective on speech processin...
Recent years have been higher demands for automatic speech recognition (ASR) systems that are able t...