The performance of an automatic speech recognition (ASR) system strongly depends on the representation used for the front-end. If the extracted features do not include all relevant information, the performance of the classification stage is inherently suboptimal. This work is motivated by the fact that humans perform better at speech recognition than machines, particularly for noisy environments. The goal of this thesis is to make use of knowledge of human perception in the selection and optimization of speech features for speech recognition. Papers A and C show that robust feature selection for speech recognition can be based on models of the human auditory system. These papers show that maximizing the similarity of the Euclidian geometry ...
State-of-the-art automatic speech recognition (ASR) systems are significantly inferior to humans esp...
The objective of this study was to provide proof of concept that the speech intelligibility in quiet...
The performance of Mel-frequency cepstrum based automatic speech recognition system significantly de...
The performance of an automatic speech recognition (ASR) system strongly depends on the representati...
We describe a method to select features for speech recognition that is based on a quantitative model...
Recently, Li et al. proposed a new auditory feature for robust speech recognition in noise environme...
This thesis presents a detailed study on psychoacoustic modeling for feature extraction for robust s...
Recently, Li et al. proposed a new auditory feature for robust speech recognition in noise environme...
This doctoral thesis is the result of a research effort performed in two fields of speech technology...
Using a spectral auditory model along with perturbation based analysis, we develop a new framework t...
A new approach for speech feature extraction in automatic speech recognition (ASR) is proposed in th...
Speech recognition is the enabling technology allowing humans to communicate with computers using th...
The results of investigations into some aspects of robust speech recognition are reported in this th...
While there have been many attempts to mitigate interferences of background noise, the performance o...
Recently, a new auditory-based feature extraction algorithm for robust speech recognition in noisy e...
State-of-the-art automatic speech recognition (ASR) systems are significantly inferior to humans esp...
The objective of this study was to provide proof of concept that the speech intelligibility in quiet...
The performance of Mel-frequency cepstrum based automatic speech recognition system significantly de...
The performance of an automatic speech recognition (ASR) system strongly depends on the representati...
We describe a method to select features for speech recognition that is based on a quantitative model...
Recently, Li et al. proposed a new auditory feature for robust speech recognition in noise environme...
This thesis presents a detailed study on psychoacoustic modeling for feature extraction for robust s...
Recently, Li et al. proposed a new auditory feature for robust speech recognition in noise environme...
This doctoral thesis is the result of a research effort performed in two fields of speech technology...
Using a spectral auditory model along with perturbation based analysis, we develop a new framework t...
A new approach for speech feature extraction in automatic speech recognition (ASR) is proposed in th...
Speech recognition is the enabling technology allowing humans to communicate with computers using th...
The results of investigations into some aspects of robust speech recognition are reported in this th...
While there have been many attempts to mitigate interferences of background noise, the performance o...
Recently, a new auditory-based feature extraction algorithm for robust speech recognition in noisy e...
State-of-the-art automatic speech recognition (ASR) systems are significantly inferior to humans esp...
The objective of this study was to provide proof of concept that the speech intelligibility in quiet...
The performance of Mel-frequency cepstrum based automatic speech recognition system significantly de...