In this paper we investigate the use of noise-robust features characterizing the speech excitation signal as complementary features to the usually considered vocal tract based features for Automatic Speech Recognition (ASR). The proposed Excitation-based Features (EBF) are tested in a state-of-the-art Deep Neural Network (DNN) based hybrid acoustic model for speech recognition. The suggested excitation features expand the set of periodicity features previously considered for ASR, expecting that these features help in a better discrimination of the broad phonetic classes (e.g., fricatives, nasal, vowels, etc.). Our experiments on the AMI meeting transcription system showed that the proposed EBF yield a relative word error rate reduction of a...
Speech recognition is the enabling technology allowing humans to communicate with computers using th...
This thesis examines techniques to improve the robustness of automatic speech recognition (ASR) syst...
Reverberation in speech degrades the performance of speech recognition systems, leading to higher wo...
In this paper we investigate the use of noise-robust features characterizing the speech excitation s...
In this paper we investigate the use of robust to noise features characterizing the speech excitatio...
© 2014 IEEE. We propose a novel exemplar-based feature enhancement method for automatic speech recog...
Analytic phase of the speech signal plays an important role in human speech perception, specially in...
Expressing noisy speech spectra as a linear combination of speech and noise exemplars has been shown...
Recent progress in deep learning has revolutionized speech recognition research, with Deep Neural Ne...
As has been extensively shown, acoustic features for speech recognition can be nurtured from trainin...
Conventional speech recognition systems consist of feature extraction, acoustic and language modelin...
Automatic speech recognition (ASR) is a key core technology for the information age. ASR systems hav...
In this paper we propose two different i-vector representations that improve the noise robustness of...
The paper describes an automatic speech recognition (ASR) system for the 3rd CHiME challenge that ad...
This paper examines the individual and combined impacts of various front-end approaches on the perfo...
Speech recognition is the enabling technology allowing humans to communicate with computers using th...
This thesis examines techniques to improve the robustness of automatic speech recognition (ASR) syst...
Reverberation in speech degrades the performance of speech recognition systems, leading to higher wo...
In this paper we investigate the use of noise-robust features characterizing the speech excitation s...
In this paper we investigate the use of robust to noise features characterizing the speech excitatio...
© 2014 IEEE. We propose a novel exemplar-based feature enhancement method for automatic speech recog...
Analytic phase of the speech signal plays an important role in human speech perception, specially in...
Expressing noisy speech spectra as a linear combination of speech and noise exemplars has been shown...
Recent progress in deep learning has revolutionized speech recognition research, with Deep Neural Ne...
As has been extensively shown, acoustic features for speech recognition can be nurtured from trainin...
Conventional speech recognition systems consist of feature extraction, acoustic and language modelin...
Automatic speech recognition (ASR) is a key core technology for the information age. ASR systems hav...
In this paper we propose two different i-vector representations that improve the noise robustness of...
The paper describes an automatic speech recognition (ASR) system for the 3rd CHiME challenge that ad...
This paper examines the individual and combined impacts of various front-end approaches on the perfo...
Speech recognition is the enabling technology allowing humans to communicate with computers using th...
This thesis examines techniques to improve the robustness of automatic speech recognition (ASR) syst...
Reverberation in speech degrades the performance of speech recognition systems, leading to higher wo...