The addition of visual information derived from the speaker’s lip movements to a speech recogniser (speechread-ing) can significantly enhance the performance of the recog-niser when it is operating under adverse signal-to-noise ra-tios. However, processing of video signals imposes a large computational demand on the system and there is little point in using speechreading techniques if similar performance gains can be obtained using techniques which operate on only the audio signal and which are less computationally ex-pensive. In this paper, we show that combining visual infor-mation with an audio noise compensation technique (spectral subtraction) leads to a performance significantly higher than that obtained using speechreading only or no...
Seeing the talker improves the intelligibility of speech degraded by noise (a visual speech enhancem...
In this paper, we propose a novel spectral subtraction approach for speech enhancement via maximum l...
In this paper we build on our recent work, where we successfully incorporated facial depth data of a...
ABSTRACT The addition of visual information derived from the speaker's lip movements to a speec...
A major goal of current speech recognition research is to improve the robustness of recognition syst...
Visual noise insensitivity is important to audio visual speech recognition (AVSR). Visual noise can ...
We present recent work on improving the performance of automated speech recognizers by using additio...
Speechreading increases intelligibility in human speech perception. This suggests that conventional ...
Speechreading increases intelligibility in human speech perception. This suggests that conventional ...
Visual noise insensitivity is important to audio visual speech recognition (AVSR). Visual noise can ...
It is well known that additive noise can cause a significant decrease in performance for an automati...
OBJECTIVES: The aim of this study was to evaluate the benefit that listeners obtain from visually pr...
In this paper, we propose a novel spectral subtraction approach for speech enhancement via maximum l...
This article describes a modified technique for enhancing noisy speech to improve automatic speech r...
A quantitative measure of relevance is proposed for the task of constructing visual feature sets whi...
Seeing the talker improves the intelligibility of speech degraded by noise (a visual speech enhancem...
In this paper, we propose a novel spectral subtraction approach for speech enhancement via maximum l...
In this paper we build on our recent work, where we successfully incorporated facial depth data of a...
ABSTRACT The addition of visual information derived from the speaker's lip movements to a speec...
A major goal of current speech recognition research is to improve the robustness of recognition syst...
Visual noise insensitivity is important to audio visual speech recognition (AVSR). Visual noise can ...
We present recent work on improving the performance of automated speech recognizers by using additio...
Speechreading increases intelligibility in human speech perception. This suggests that conventional ...
Speechreading increases intelligibility in human speech perception. This suggests that conventional ...
Visual noise insensitivity is important to audio visual speech recognition (AVSR). Visual noise can ...
It is well known that additive noise can cause a significant decrease in performance for an automati...
OBJECTIVES: The aim of this study was to evaluate the benefit that listeners obtain from visually pr...
In this paper, we propose a novel spectral subtraction approach for speech enhancement via maximum l...
This article describes a modified technique for enhancing noisy speech to improve automatic speech r...
A quantitative measure of relevance is proposed for the task of constructing visual feature sets whi...
Seeing the talker improves the intelligibility of speech degraded by noise (a visual speech enhancem...
In this paper, we propose a novel spectral subtraction approach for speech enhancement via maximum l...
In this paper we build on our recent work, where we successfully incorporated facial depth data of a...