International audienceUncertainty decoding has been successfully used for speech recognition in highly nonstationary noise environments. Yet, accurate estimation of the uncertainty on the denoised signals and propagation to the features remain difficult. In this work, we propose to fuse the uncertainty estimates obtained from different uncertainty estimators and propagators by linear combination. The fusion coefficients are optimized by minimizing a measure of divergence with oracle estimates on development data. Using the Kullback-Leibler divergence, we obtain 18\% relative error rate reduction on the 2nd CHiME Challenge with respect to conventional decoding, that is about twice as much as the reduction achieved by the best single uncertai...
International audienceWe present a joint spatial and spectral denoising front-end for Track 1 of the...
Yilmaz E., Gemmeke J.F., Van hamme H., ''Noise-robust speech recognition with exemplar-based sparse ...
The performance of automatic speaker recognition systems degrades when facing distorted speech data ...
International audienceWe consider the framework of uncertainty propagation for automatic speech reco...
Cette thèse se focalise sur la reconnaissance automatique de la parole (RAP) robuste au bruit. Elle ...
International audienceAutomatic speech recognition (ASR) in noisy environments remains a challenging...
This thesis focuses on noise robust automatic speech recognition (ASR). It includes twoparts. First,...
International audienceWe consider the problem of robust automatic speech recognition (ASR) in noisy ...
International audienceThe uncertainty decoding framework is known to improve deep neural network (DN...
International audienceWe consider the problem of uncertainty estimation for noise-robust ASR. Existi...
International audienceIn order to improve the ASR performance in noisy environments , distorted spee...
The term uncertainty decoding has been phrased for a class of robustness enhancing algorithms in aut...
The use of feature enhancement techniques to obtain estimates of the clean parameters is a common ap...
In this paper we develop different mathematical models in the framework of the multi-stream paradigm...
In this thesis, a joint optimal method for clean speech estimation and ASR in a mismatched condition...
International audienceWe present a joint spatial and spectral denoising front-end for Track 1 of the...
Yilmaz E., Gemmeke J.F., Van hamme H., ''Noise-robust speech recognition with exemplar-based sparse ...
The performance of automatic speaker recognition systems degrades when facing distorted speech data ...
International audienceWe consider the framework of uncertainty propagation for automatic speech reco...
Cette thèse se focalise sur la reconnaissance automatique de la parole (RAP) robuste au bruit. Elle ...
International audienceAutomatic speech recognition (ASR) in noisy environments remains a challenging...
This thesis focuses on noise robust automatic speech recognition (ASR). It includes twoparts. First,...
International audienceWe consider the problem of robust automatic speech recognition (ASR) in noisy ...
International audienceThe uncertainty decoding framework is known to improve deep neural network (DN...
International audienceWe consider the problem of uncertainty estimation for noise-robust ASR. Existi...
International audienceIn order to improve the ASR performance in noisy environments , distorted spee...
The term uncertainty decoding has been phrased for a class of robustness enhancing algorithms in aut...
The use of feature enhancement techniques to obtain estimates of the clean parameters is a common ap...
In this paper we develop different mathematical models in the framework of the multi-stream paradigm...
In this thesis, a joint optimal method for clean speech estimation and ASR in a mismatched condition...
International audienceWe present a joint spatial and spectral denoising front-end for Track 1 of the...
Yilmaz E., Gemmeke J.F., Van hamme H., ''Noise-robust speech recognition with exemplar-based sparse ...
The performance of automatic speaker recognition systems degrades when facing distorted speech data ...