Two approaches for modulation spectrum equalization are proposed for robust feature extraction in speech recognition. In both cases the temporal trajectories of the feature parameters are first transformed into the modulation spectrum. In the spectral histogram equalization (SHE) approach, we equalize the histogram of the modulation spectrum for each utterance to a reference histogram obtained from clean training data. In the magnitude ratio equalization (MRE) approach, we equalize the magnitude ratio of lower to higher frequency components on the modulation spectrum to a reference value also obtained from clean training data. Preliminary experimental results performed on the AURORA 2 testing environment indicate that significant performanc...
The full modulation spectrum is a high-dimensional representation of one-dimensional audio signals. ...
The use of a speech recognition system with telephone channel environments, or different microphones...
In any real environment, noises degrade the performance of Automatic Speech Recognition (ASR) system...
[[abstract]]In this paper, we present two novel algorithms to improve the noise robustness of featur...
[[abstract]]In this article, we present an effective compensation scheme to improve noise robustness...
Mismatch between training and test conditions deteriorates the performance of speech recognizers. Th...
This paper presents an improved version of a spectral normalisation based method for extraction of s...
We present a new feature representation for speech recognition based on both amplitude modulation sp...
This paper presents a method for extracting MFCC parameters from a normalised power spectrum density...
This paper presents a novel noise-robust feature extraction method for speech recognition using the ...
This paper presents a spectral normalisation based method for extraction of speech robust features i...
The goal of the work described in this paper is to develop and evaluate procedures for automatic est...
Feature statistics normalization in the cepstral domain is one of the most performing approaches for...
In this paper, we present robust feature extractors that incorporate a regularized minimum variance ...
The paper tackles the problem of noisy speech recognition. In particular, we present a novel approac...
The full modulation spectrum is a high-dimensional representation of one-dimensional audio signals. ...
The use of a speech recognition system with telephone channel environments, or different microphones...
In any real environment, noises degrade the performance of Automatic Speech Recognition (ASR) system...
[[abstract]]In this paper, we present two novel algorithms to improve the noise robustness of featur...
[[abstract]]In this article, we present an effective compensation scheme to improve noise robustness...
Mismatch between training and test conditions deteriorates the performance of speech recognizers. Th...
This paper presents an improved version of a spectral normalisation based method for extraction of s...
We present a new feature representation for speech recognition based on both amplitude modulation sp...
This paper presents a method for extracting MFCC parameters from a normalised power spectrum density...
This paper presents a novel noise-robust feature extraction method for speech recognition using the ...
This paper presents a spectral normalisation based method for extraction of speech robust features i...
The goal of the work described in this paper is to develop and evaluate procedures for automatic est...
Feature statistics normalization in the cepstral domain is one of the most performing approaches for...
In this paper, we present robust feature extractors that incorporate a regularized minimum variance ...
The paper tackles the problem of noisy speech recognition. In particular, we present a novel approac...
The full modulation spectrum is a high-dimensional representation of one-dimensional audio signals. ...
The use of a speech recognition system with telephone channel environments, or different microphones...
In any real environment, noises degrade the performance of Automatic Speech Recognition (ASR) system...