Thesis (Master's)--University of Washington, 2020With motivation from histogram matching in image processing used to redistribute pixel probabilities in each color channel of an image, a new approach with an old technique is used for reducing acoustic mismatch between audio signals. Mel-frequency-dependent histogram matching with a silence threshold used in the log Mel-spectrogram domain is implemented before the decoding step in an automatic speech recognition system. The technique is shown to be effective within a system built to recognize low-resource, noisy, compressed, and distorted air traffic control communications. The algorithm has been shown to be robust to high acoustic variance and capable of reducing acoustic mismatch between t...
Many new consumer applications are based on the use of automatic speech recognition (ASR) systems, s...
International audienceMany audio processing algorithms have optimal performance for specific signal ...
The paper describes a system for automatic speech recognition (ASR) that is benchmarked with data of...
Automatic speech recognition (ASR) is a fascinating field of science where the machine almost become...
Mismatch between training and test conditions deteriorates the performance of speech recognizers. Th...
Feature statistics normalization in the cepstral domain is one of the most performing approaches for...
This paper describes an approach to increase the noise robust-ness of automatic speech recognition s...
Feature statistics normalization in the cepstral domain is one of the most performing approaches for...
International audienceThis article introduces automatic speech recognition based on Electro-Magnetic...
We propose a new model adaptation method based on the histogram equalization technique for providin...
The performance of current automatic speech recognition (ASR) systems radically deteriorates when th...
Developing automatic speech recognition systems that are robust to mismatched and noisy channel cond...
Noise robustness is one of the primary challenges facing most automatic speech recognition (ASR) sys...
[[abstract]]Speech is the primary and the most convenient means of communication between individuals...
The acoustic environment in which speech is recorded has a strong influence on the statistical distr...
Many new consumer applications are based on the use of automatic speech recognition (ASR) systems, s...
International audienceMany audio processing algorithms have optimal performance for specific signal ...
The paper describes a system for automatic speech recognition (ASR) that is benchmarked with data of...
Automatic speech recognition (ASR) is a fascinating field of science where the machine almost become...
Mismatch between training and test conditions deteriorates the performance of speech recognizers. Th...
Feature statistics normalization in the cepstral domain is one of the most performing approaches for...
This paper describes an approach to increase the noise robust-ness of automatic speech recognition s...
Feature statistics normalization in the cepstral domain is one of the most performing approaches for...
International audienceThis article introduces automatic speech recognition based on Electro-Magnetic...
We propose a new model adaptation method based on the histogram equalization technique for providin...
The performance of current automatic speech recognition (ASR) systems radically deteriorates when th...
Developing automatic speech recognition systems that are robust to mismatched and noisy channel cond...
Noise robustness is one of the primary challenges facing most automatic speech recognition (ASR) sys...
[[abstract]]Speech is the primary and the most convenient means of communication between individuals...
The acoustic environment in which speech is recorded has a strong influence on the statistical distr...
Many new consumer applications are based on the use of automatic speech recognition (ASR) systems, s...
International audienceMany audio processing algorithms have optimal performance for specific signal ...
The paper describes a system for automatic speech recognition (ASR) that is benchmarked with data of...