In this work, normalization techniques in the acoustic feature space are studied which improve the robustness of automatic speech recognition systems. It is shown that there is a fundamental mismatch between training and test data which causes degraded recognition performance. Adaptation and normalization, basic strategies to reduce the mismatch, are introduced and placed into the framework of statistical speech recognition. A classification scheme for different normalization techniques is introduced. Common normalization schemes proposed in the literature are motivated and discussed, and two promising techniques are implemented and studied in detail. Vocal tract length normalization relies on frequency axis warping during signal analysis t...
One of the major challenges for Automatic Speech Recognition is to handle speech variability. Inter-...
This thesis examines techniques to improve the robustness of automatic speech recogni-tion (ASR) sys...
In this paper, we propose a framework for joint normalization of spectral and temporal statistics of...
This thesis addresses the general problem of maintaining robust automatic speech recognition (ASR) p...
One important issue in speech recognition is the ability to handle variations caused by unseen speak...
In this paper, a novel speaker normalization method is presented and compared to a well known vocal ...
This thesis examines techniques to improve the robustness of automatic speech recognition (ASR) syst...
One of the main problems faced by automatic speech recognition is the variability of the testing con...
This paper examines techniques for speaker normalisation and adaptation that are applied in training...
In speech recognition, speaker-dependence of a speech recognition system comes from speaker-dependen...
The perception of syllable prominence depends to a limited extent on the acoustic properties of the ...
Generally speaking, the speaker-dependence of a speech recognition system stems from speaker-depende...
Abstract: The changing on peaks structure of the speech spectrum is perhaps the most important cause...
A proven method for achieving effective automatic speech recognition (ASR) due to speaker difference...
Vocal tract normalization (VTN) is an effective way to reduce inter-speaker variability mainly cause...
One of the major challenges for Automatic Speech Recognition is to handle speech variability. Inter-...
This thesis examines techniques to improve the robustness of automatic speech recogni-tion (ASR) sys...
In this paper, we propose a framework for joint normalization of spectral and temporal statistics of...
This thesis addresses the general problem of maintaining robust automatic speech recognition (ASR) p...
One important issue in speech recognition is the ability to handle variations caused by unseen speak...
In this paper, a novel speaker normalization method is presented and compared to a well known vocal ...
This thesis examines techniques to improve the robustness of automatic speech recognition (ASR) syst...
One of the main problems faced by automatic speech recognition is the variability of the testing con...
This paper examines techniques for speaker normalisation and adaptation that are applied in training...
In speech recognition, speaker-dependence of a speech recognition system comes from speaker-dependen...
The perception of syllable prominence depends to a limited extent on the acoustic properties of the ...
Generally speaking, the speaker-dependence of a speech recognition system stems from speaker-depende...
Abstract: The changing on peaks structure of the speech spectrum is perhaps the most important cause...
A proven method for achieving effective automatic speech recognition (ASR) due to speaker difference...
Vocal tract normalization (VTN) is an effective way to reduce inter-speaker variability mainly cause...
One of the major challenges for Automatic Speech Recognition is to handle speech variability. Inter-...
This thesis examines techniques to improve the robustness of automatic speech recogni-tion (ASR) sys...
In this paper, we propose a framework for joint normalization of spectral and temporal statistics of...