A variety of methods for audio-visual integration, which inte-grate audio and visual information at the level of either features, states, or classifier outputs, have been proposed for the purpose of robust speech recognition. However, these methods do not al-ways fully utilize auditory information when the signal-to-noise ratio becomes low. In this paper, we propose a novel approach to estimate speech signal in noise environments. The key idea behind this approach is to exploit clean speech candidates gen-erated by using timing structures between mouth movements and sound signals. We first extract a pair of feature sequences of media signals and segment each sequence into temporal inter-vals. Then, we construct a cross-media timing-structur...
In this paper, we present a method of extracting the time-delay between speech signals collected at ...
Abstract:- This paper is the continuation of previously published work in which we have been analysi...
The performance of automatic speech recognition (ASR) is known to degrade under noise corruption. Su...
In this paper, we propose a novel approach to speaker detection by an integration of audio-visual in...
This paper describes a complete system for audio-visual recognition of continuous speech including r...
In this paper, a noise adaptive speech recognition approach is proposed for recognizing speech which...
We address the problem of robust lip tracking, visual speech feature extraction, and sensor integrat...
The speech signal is inherently characterized by its variations in time, which get reflected as vari...
As a fundamental part of single microphone speech quality enhancement, noise power spectrum estimati...
This paper presents a formant-tracking method for estimation of the time-varying trajectories of a l...
[[abstract]]© 1999 Elsevier - This paper introduces a new representation of speech that is invariant...
The paper reviews several techniques which are used in conjunction with the short-term analysis and ...
We propose time-frequency domain methods for noise estimation and speech enhancement. A speech pres...
This paper presents a time-frequency estimator for en-hancement of noisy speech in the DFT domain. T...
This paper presents a formant-tracking method for estimation of the time-varying trajectories of a l...
In this paper, we present a method of extracting the time-delay between speech signals collected at ...
Abstract:- This paper is the continuation of previously published work in which we have been analysi...
The performance of automatic speech recognition (ASR) is known to degrade under noise corruption. Su...
In this paper, we propose a novel approach to speaker detection by an integration of audio-visual in...
This paper describes a complete system for audio-visual recognition of continuous speech including r...
In this paper, a noise adaptive speech recognition approach is proposed for recognizing speech which...
We address the problem of robust lip tracking, visual speech feature extraction, and sensor integrat...
The speech signal is inherently characterized by its variations in time, which get reflected as vari...
As a fundamental part of single microphone speech quality enhancement, noise power spectrum estimati...
This paper presents a formant-tracking method for estimation of the time-varying trajectories of a l...
[[abstract]]© 1999 Elsevier - This paper introduces a new representation of speech that is invariant...
The paper reviews several techniques which are used in conjunction with the short-term analysis and ...
We propose time-frequency domain methods for noise estimation and speech enhancement. A speech pres...
This paper presents a time-frequency estimator for en-hancement of noisy speech in the DFT domain. T...
This paper presents a formant-tracking method for estimation of the time-varying trajectories of a l...
In this paper, we present a method of extracting the time-delay between speech signals collected at ...
Abstract:- This paper is the continuation of previously published work in which we have been analysi...
The performance of automatic speech recognition (ASR) is known to degrade under noise corruption. Su...