International audienceThe aim of this exploratory work is to predict, a priori, the quality of the automatic transcription in the case of speech mixed with music. In order to make this prediction, we need to quantify the impact of music (considered as noise in our study) on speech, before decoding by an Automatic Speech Recognition (ASR) system. Generally, the es- timate of noise level in a speech signal exploits the bimodality of the noisy speech dis- tribution. When the studied noise is music, the distribution has more than two modes, which makes noise level estimation unviable. We propose a new set of features. Entropy modulation (Pinquier et al., 2002, ICSLP) detects how much a signal is consid- ered speech by measuring the lack of orde...
Speech in a noisy background presents a challenge for the recognition of that speech both by human l...
International audienceAge-related hearing loss (ARHL) is a very prevalent hearing disorder in adults...
The accurate extraction of two particular features from the speech signal affected by additive white...
International audienceAdvances in the field of Automatic Speech Recognition (ASR) make it possible t...
Abstract. Instrumental measures have been primarily used in order to predict speech quality or speec...
The acoustic environment in which speech is recorded has a strong influence on the statistical distr...
Many signal processing methods have been proposed to improve the quality of speech recorded in the p...
The performance of automatic speech recognition based on coded-decoded speech heavily depends on the...
Item does not contain fulltextThis paper investigates a computational model that combines a frontend...
It is well established that accent recognition can be as accurate as up to 95% when the signals are ...
Several modification algorithms that alter natural or synthetic speech with the goal of improving in...
Automatic speech recognition in everyday environments must be robust to significant levels of reverb...
Evaluation of automatic speech recognition (ASR) systems is difficult and costly, since it requires ...
This paper examines the effect of applying noise compensation to acoustic speech feature prediction ...
The simulation framework for auditory discrimination experiments (FADE) was adopted and validated to...
Speech in a noisy background presents a challenge for the recognition of that speech both by human l...
International audienceAge-related hearing loss (ARHL) is a very prevalent hearing disorder in adults...
The accurate extraction of two particular features from the speech signal affected by additive white...
International audienceAdvances in the field of Automatic Speech Recognition (ASR) make it possible t...
Abstract. Instrumental measures have been primarily used in order to predict speech quality or speec...
The acoustic environment in which speech is recorded has a strong influence on the statistical distr...
Many signal processing methods have been proposed to improve the quality of speech recorded in the p...
The performance of automatic speech recognition based on coded-decoded speech heavily depends on the...
Item does not contain fulltextThis paper investigates a computational model that combines a frontend...
It is well established that accent recognition can be as accurate as up to 95% when the signals are ...
Several modification algorithms that alter natural or synthetic speech with the goal of improving in...
Automatic speech recognition in everyday environments must be robust to significant levels of reverb...
Evaluation of automatic speech recognition (ASR) systems is difficult and costly, since it requires ...
This paper examines the effect of applying noise compensation to acoustic speech feature prediction ...
The simulation framework for auditory discrimination experiments (FADE) was adopted and validated to...
Speech in a noisy background presents a challenge for the recognition of that speech both by human l...
International audienceAge-related hearing loss (ARHL) is a very prevalent hearing disorder in adults...
The accurate extraction of two particular features from the speech signal affected by additive white...