This paper investigates a computational model that combines a frontend based on an auditory model with an exemplar-based sparse coding procedure for estimating the posterior probabilities of sub-word units when processing noisified speech. Envelope modulation spectrogram (EMS) features are extracted using an auditory model which decomposes the envelopes of the outputs of a bank of gammatone filters into one lowpass and multiple bandpass components. Through a systematic analysis of the configuration of the modulation filterbank, we investigate how and why different configurations affect the posterior probabilities of sub-word units by measuring the recognition accuracy on a semantics-free speech recognition task. Our main finding is that rep...
Automatic speech recognition (ASR) is a fascinating field of science where the machine almost become...
We propose a novel framework for noise robust automatic speech recognition (ASR) based on cochlear i...
One of the biggest obstacles that hinder the widespread use of automatic speech recognition technolo...
Item does not contain fulltextThis paper investigates a computational model that combines a frontend...
In this paper, we analyze the temporal modulation char-acteristics of speech and noise from a speech...
The full modulation spectrum is a high-dimensional representation of one-dimensional audio signals. ...
Feature computation models for automatic speech recognition (ASR) systems have long been modeled on ...
The human ability to classify acoustic sounds is still unmatched compared to recent methods in machi...
The performance of Mel-frequency cepstrum based automatic speech recognition system significantly de...
While there have been many attempts to mitigate interferences of background noise, the performance o...
© 2014 IEEE. We propose a novel exemplar-based feature enhancement method for automatic speech recog...
A new approach to automatic speech recognition based on independent class-conditional probability es...
Expressing noisy speech spectra as a linear combination of speech and noise exemplars has been shown...
Speech recognition is the enabling technology allowing humans to communicate with computers using th...
In this paper we develop different mathematical models in the framework of the multi-stream paradigm...
Automatic speech recognition (ASR) is a fascinating field of science where the machine almost become...
We propose a novel framework for noise robust automatic speech recognition (ASR) based on cochlear i...
One of the biggest obstacles that hinder the widespread use of automatic speech recognition technolo...
Item does not contain fulltextThis paper investigates a computational model that combines a frontend...
In this paper, we analyze the temporal modulation char-acteristics of speech and noise from a speech...
The full modulation spectrum is a high-dimensional representation of one-dimensional audio signals. ...
Feature computation models for automatic speech recognition (ASR) systems have long been modeled on ...
The human ability to classify acoustic sounds is still unmatched compared to recent methods in machi...
The performance of Mel-frequency cepstrum based automatic speech recognition system significantly de...
While there have been many attempts to mitigate interferences of background noise, the performance o...
© 2014 IEEE. We propose a novel exemplar-based feature enhancement method for automatic speech recog...
A new approach to automatic speech recognition based on independent class-conditional probability es...
Expressing noisy speech spectra as a linear combination of speech and noise exemplars has been shown...
Speech recognition is the enabling technology allowing humans to communicate with computers using th...
In this paper we develop different mathematical models in the framework of the multi-stream paradigm...
Automatic speech recognition (ASR) is a fascinating field of science where the machine almost become...
We propose a novel framework for noise robust automatic speech recognition (ASR) based on cochlear i...
One of the biggest obstacles that hinder the widespread use of automatic speech recognition technolo...