Using a spectral auditory model along with perturbation based analysis, we develop a new framework to optimize a set of fea-tures such that it emulates the behavior of the human auditory sys-tem. The optimization is carried out in an off-line manner based on the conjecture that the local geometries of the feature domain and the perceptual auditory domain should be similar. Using this principle, we modify and optimize the static mel frequency cep-stral coefficients (MFCCs) without considering any feedback from the speech recognition system. We show that improved recognition performance is obtained for any environmental condition, clean as well as noisy. Index Terms: MFCC, auditory model, ASR. 1
The human ability to classify acoustic sounds is still unmatched compared to recent methods in machi...
A new approach for speech feature extraction in automatic speech recognition (ASR) is proposed in th...
This thesis presents a detailed study on psychoacoustic modeling for feature extraction for robust s...
The performance of an automatic speech recognition (ASR) system strongly depends on the representati...
The performance of an automatic speech recognition (ASR) system strongly depends on the representati...
In this paper we study the noise-robustness of mel-frequency cep-stral coefficients (MFCCs) and expl...
Performance of an automatic speech recognition system drops dramatically in the presence of backgrou...
Performance of an automatic speech recognition system drops dramatically in the presence of backgrou...
MFCC (Mel-Frequency Cepstral Coefficients) is a kind of traditional speech feature widely used in sp...
While there have been many attempts to mitigate interferences of background noise, the performance o...
Many speech recognition systems use mel-frequency cep-stral coefficient (mfcc) feature extraction as...
This paper examines the effect of applying noise compensation to acoustic speech feature prediction ...
This paper proposes a technique of extracting robust feature vectors for ASR. The technique is inspi...
Abstract — The Mel-Frequency Cepstral Coefficient (MFCC) or Perceptual Linear Prediction (PLP) featu...
Speech recognition is the enabling technology allowing humans to communicate with computers using th...
The human ability to classify acoustic sounds is still unmatched compared to recent methods in machi...
A new approach for speech feature extraction in automatic speech recognition (ASR) is proposed in th...
This thesis presents a detailed study on psychoacoustic modeling for feature extraction for robust s...
The performance of an automatic speech recognition (ASR) system strongly depends on the representati...
The performance of an automatic speech recognition (ASR) system strongly depends on the representati...
In this paper we study the noise-robustness of mel-frequency cep-stral coefficients (MFCCs) and expl...
Performance of an automatic speech recognition system drops dramatically in the presence of backgrou...
Performance of an automatic speech recognition system drops dramatically in the presence of backgrou...
MFCC (Mel-Frequency Cepstral Coefficients) is a kind of traditional speech feature widely used in sp...
While there have been many attempts to mitigate interferences of background noise, the performance o...
Many speech recognition systems use mel-frequency cep-stral coefficient (mfcc) feature extraction as...
This paper examines the effect of applying noise compensation to acoustic speech feature prediction ...
This paper proposes a technique of extracting robust feature vectors for ASR. The technique is inspi...
Abstract — The Mel-Frequency Cepstral Coefficient (MFCC) or Perceptual Linear Prediction (PLP) featu...
Speech recognition is the enabling technology allowing humans to communicate with computers using th...
The human ability to classify acoustic sounds is still unmatched compared to recent methods in machi...
A new approach for speech feature extraction in automatic speech recognition (ASR) is proposed in th...
This thesis presents a detailed study on psychoacoustic modeling for feature extraction for robust s...