Noise robustness is one of the primary challenges facing most automatic speech recognition (ASR) systems. A vast amount of research efforts on preventing the degradation of ASR performance under various noisy environments have been made during the past several years. In this paper, we consider the use of histogram equalization (HEQ) for robust ASR. In contrast to conventional methods, a novel data fitting method based on polynomial regression was presented to efficiently approximate the inverse of the cumulative density functions of speech feature vectors for HEQ. Moreover, a more elaborate attempt of using such polynomial regression models to directly characterizing the relationship between the speech feature vectors and their correspondin...
A new front-end normalization algorithm that uses a parametric non-linear transformation is proposed...
Thesis (Master's)--University of Washington, 2020With motivation from histogram matching in image pr...
It is well-known that the performance of automatic speech recognition (ASR) systems are easily affec...
The performance of current automatic speech recognition (ASR) systems radically deteriorates when th...
[[abstract]]Speech is the primary and the most convenient means of communication between individuals...
[[abstract]]The performance of current automatic speech recognition (ASR) systems often deteriorates...
[[abstract]]With the rapid development of intelligent transportation systems (ITS), how to provide u...
Automatic speech recognition (ASR) is a fascinating field of science where the machine almost become...
Mismatch between training and test conditions deteriorates the performance of speech recognizers. Th...
This paper describes an approach to increase the noise robust-ness of automatic speech recognition s...
We propose a new model adaptation method based on the histogram equalization technique for providin...
Feature statistics normalization in the cepstral domain is one of the most performing approaches for...
Feature statistics normalization in the cepstral domain is one of the most performing approaches for...
This thesis examines techniques to improve the robustness of automatic speech recognition (ASR) syst...
We propose a novel technique for robust voiced/unvoiced segment detection in noisy speech, based o...
A new front-end normalization algorithm that uses a parametric non-linear transformation is proposed...
Thesis (Master's)--University of Washington, 2020With motivation from histogram matching in image pr...
It is well-known that the performance of automatic speech recognition (ASR) systems are easily affec...
The performance of current automatic speech recognition (ASR) systems radically deteriorates when th...
[[abstract]]Speech is the primary and the most convenient means of communication between individuals...
[[abstract]]The performance of current automatic speech recognition (ASR) systems often deteriorates...
[[abstract]]With the rapid development of intelligent transportation systems (ITS), how to provide u...
Automatic speech recognition (ASR) is a fascinating field of science where the machine almost become...
Mismatch between training and test conditions deteriorates the performance of speech recognizers. Th...
This paper describes an approach to increase the noise robust-ness of automatic speech recognition s...
We propose a new model adaptation method based on the histogram equalization technique for providin...
Feature statistics normalization in the cepstral domain is one of the most performing approaches for...
Feature statistics normalization in the cepstral domain is one of the most performing approaches for...
This thesis examines techniques to improve the robustness of automatic speech recognition (ASR) syst...
We propose a novel technique for robust voiced/unvoiced segment detection in noisy speech, based o...
A new front-end normalization algorithm that uses a parametric non-linear transformation is proposed...
Thesis (Master's)--University of Washington, 2020With motivation from histogram matching in image pr...
It is well-known that the performance of automatic speech recognition (ASR) systems are easily affec...