Traditional speech enhancement methods optimise signal-level criteria such as signal-to-noise ratio, but these approaches are sub-optimal for noise-robust speech recognition. Likelihood-maximising (LIMA) frameworks are an alternative that optimise parameters of enhancement algorithms based on state sequences generated for utterances with known transcriptions. Previous reports of LIMA frameworks have shown significant promise for improving speech recognition accuracies under additive background noise for a range of speech enhancement techniques. In this paper we discuss the drawbacks of the LIMA approach when multiple layers of acoustic mismatch are present – namely background noise and speaker accent. Experimentation using LIMA-based Mel-fi...
Information theoretical concepts have been used in the analysis of human hearing and for the definit...
This thesis describes research into effective voice biometrics (speaker recognition) under mismatche...
Performance of automatic speech recognition relies on a vast amount of training speech data mostly r...
Traditional speech enhancement methods optimise signal-level criteria such as signal-to-noise ratio,...
Speech recognition in car environments has been identified as a valuable means for reducing driver d...
Automatic Speech Recognition (ASR) has matured into a technology which is becoming more common in ou...
Abstract Speech recognition in car environments has been identified as a valuable means for reducing...
Speech recognition in car environments has been identified as a valuable means for reducing driver d...
It is well known that additive noise can cause a significant decrease in performance for an automati...
Much research has been focused on the problem of achieving automatic speech recognition (ASR) which ...
In noisy environments, speech recognition accuracy\ud degrades significantly. Speech enhancement alg...
The usage of deep learning algorithms has resulted in significant progress in auto- matic speech rec...
AbstractThe maximum a posteriori (MAP) criterion is popularly used for feature compensation (FC) and...
Wang Y., Vuerinckx R., Gemmeke J., Cranen B., Van hamme H., ''Evaluation of missing data techniques ...
Abstract — In noisy environments, speech recognition accuracy degrades significantly. Speech enhance...
Information theoretical concepts have been used in the analysis of human hearing and for the definit...
This thesis describes research into effective voice biometrics (speaker recognition) under mismatche...
Performance of automatic speech recognition relies on a vast amount of training speech data mostly r...
Traditional speech enhancement methods optimise signal-level criteria such as signal-to-noise ratio,...
Speech recognition in car environments has been identified as a valuable means for reducing driver d...
Automatic Speech Recognition (ASR) has matured into a technology which is becoming more common in ou...
Abstract Speech recognition in car environments has been identified as a valuable means for reducing...
Speech recognition in car environments has been identified as a valuable means for reducing driver d...
It is well known that additive noise can cause a significant decrease in performance for an automati...
Much research has been focused on the problem of achieving automatic speech recognition (ASR) which ...
In noisy environments, speech recognition accuracy\ud degrades significantly. Speech enhancement alg...
The usage of deep learning algorithms has resulted in significant progress in auto- matic speech rec...
AbstractThe maximum a posteriori (MAP) criterion is popularly used for feature compensation (FC) and...
Wang Y., Vuerinckx R., Gemmeke J., Cranen B., Van hamme H., ''Evaluation of missing data techniques ...
Abstract — In noisy environments, speech recognition accuracy degrades significantly. Speech enhance...
Information theoretical concepts have been used in the analysis of human hearing and for the definit...
This thesis describes research into effective voice biometrics (speaker recognition) under mismatche...
Performance of automatic speech recognition relies on a vast amount of training speech data mostly r...