Abstract — The Mel-Frequency Cepstral Coefficient (MFCC) or Perceptual Linear Prediction (PLP) feature extraction typically used for automatic speech recognition (ASR) employ several principles which have known counterparts in the cochlea and auditory nerve: frequency decomposition, mel- or bark-warping of the frequency axis, and compression of amplitudes. It seems natural to ask if one can profitably employ a counterpart of the next physiological processing step, synaptic adaptation. We therefore incorporated a simplified model of short-term adaptation into MFCC feature extraction. We evaluated the resulting ASR performance on the AURORA 2 and AURORA 3 tasks, in comparison to ordinary MFCCs, MFCCs processed by RASTA, and MFCCs processed by...
The performance of Mel-frequency cepstrum based automatic speech recognition system significantly de...
A new approach for speech feature extraction in automatic speech recognition (ASR) is proposed in th...
This thesis presents a detailed study on psychoacoustic modeling for feature extraction for robust s...
The human ability to classify acoustic sounds is still unmatched compared to recent methods in machi...
The performance of an automatic speech recognition (ASR) system strongly depends on the representati...
Speech feature extraction is critical for ASR systems. Such successful features as MFCC and PLP use ...
The results of investigations into some aspects of robust speech recognition are reported in this th...
sitä The effect of bio-inspired spectro-temporal processing for automatic speech recognition (ASR) ...
In this paper, we present advances in the modeling of the masking behavior of the human auditory sys...
Many speech recognition systems use mel-frequency cep-stral coefficient (mfcc) feature extraction as...
Using a spectral auditory model along with perturbation based analysis, we develop a new framework t...
MFCC (Mel-Frequency Cepstral Coefficients) is a kind of traditional speech feature widely used in sp...
This report presents a detail study on psychoacoustic modeling for feature extraction for robust con...
The Mel Frequency cepstral coefficients are the most widely used feature in speech recognition but t...
A deep study about the computational models of the auditory peripheral system from three different r...
The performance of Mel-frequency cepstrum based automatic speech recognition system significantly de...
A new approach for speech feature extraction in automatic speech recognition (ASR) is proposed in th...
This thesis presents a detailed study on psychoacoustic modeling for feature extraction for robust s...
The human ability to classify acoustic sounds is still unmatched compared to recent methods in machi...
The performance of an automatic speech recognition (ASR) system strongly depends on the representati...
Speech feature extraction is critical for ASR systems. Such successful features as MFCC and PLP use ...
The results of investigations into some aspects of robust speech recognition are reported in this th...
sitä The effect of bio-inspired spectro-temporal processing for automatic speech recognition (ASR) ...
In this paper, we present advances in the modeling of the masking behavior of the human auditory sys...
Many speech recognition systems use mel-frequency cep-stral coefficient (mfcc) feature extraction as...
Using a spectral auditory model along with perturbation based analysis, we develop a new framework t...
MFCC (Mel-Frequency Cepstral Coefficients) is a kind of traditional speech feature widely used in sp...
This report presents a detail study on psychoacoustic modeling for feature extraction for robust con...
The Mel Frequency cepstral coefficients are the most widely used feature in speech recognition but t...
A deep study about the computational models of the auditory peripheral system from three different r...
The performance of Mel-frequency cepstrum based automatic speech recognition system significantly de...
A new approach for speech feature extraction in automatic speech recognition (ASR) is proposed in th...
This thesis presents a detailed study on psychoacoustic modeling for feature extraction for robust s...