Abstra t. Pit h and energy are two fundamental features de-s ribing spee h, having importan e in human spee h re ognition. However, when in orporated as features in automati spee h re og-nition (ASR), they usually result in a signi ant degradation on re ognition performan e due to the noise inherent in estimating or modeling them. In this paper, we show experimentally how this an be orre ted by either onditioning the emission distributions upon these features or by marginalizing out these features in re og-nition. Sin e this is not obvious to do with standard hidden Markov models (HMMs), this work has been performed in the framework of dynami Bayesian networks (DBNs), resulting in more exibil-ity in dening the topology of the emission ...
Abstract—This letter investigates the problem of incorporating auxiliary information, e.g., pitch, z...
The use of visual features in audio-visual speech recognition (AVSR) is justified by both the speec...
We describe a dynamic Bayesian network for articulatory feature recognition. The model is intended t...
In standard automatic speech recognition (ASR), hidden Markov models (HMMs) calculate their emission...
Automatic speech recognition bases its models on the acoustic features derived from the speech signa...
Automatic speech recognition (ASR) is a very challenging problem due to the wide variety of the data...
Current technology for automatic speech recognition (ASR) uses hidden Markov models (HMMs) that reco...
This article provides a unifying Bayesian view on various approaches for acoustic model adaptation, ...
Colloque avec actes et comité de lecture. nationale.National audienceThis paper presents a novel app...
This paper describes the application of Bayesian networks to automatic speech recognition. Bayesian ...
We present a speech modeling methodology where no a priori assumption is made on the dependencies be...
This paper describes the theory and implementation of Bayesian networks in the context of automatic ...
Abstract Most of the current state-of-the-art speech recognition systems are based on speech signal ...
This paper describes the use of dynamic Bayesian networks for the task of articulatory feature recog...
Colloque avec actes et comité de lecture. internationale.International audienceState-of-the-art auto...
Abstract—This letter investigates the problem of incorporating auxiliary information, e.g., pitch, z...
The use of visual features in audio-visual speech recognition (AVSR) is justified by both the speec...
We describe a dynamic Bayesian network for articulatory feature recognition. The model is intended t...
In standard automatic speech recognition (ASR), hidden Markov models (HMMs) calculate their emission...
Automatic speech recognition bases its models on the acoustic features derived from the speech signa...
Automatic speech recognition (ASR) is a very challenging problem due to the wide variety of the data...
Current technology for automatic speech recognition (ASR) uses hidden Markov models (HMMs) that reco...
This article provides a unifying Bayesian view on various approaches for acoustic model adaptation, ...
Colloque avec actes et comité de lecture. nationale.National audienceThis paper presents a novel app...
This paper describes the application of Bayesian networks to automatic speech recognition. Bayesian ...
We present a speech modeling methodology where no a priori assumption is made on the dependencies be...
This paper describes the theory and implementation of Bayesian networks in the context of automatic ...
Abstract Most of the current state-of-the-art speech recognition systems are based on speech signal ...
This paper describes the use of dynamic Bayesian networks for the task of articulatory feature recog...
Colloque avec actes et comité de lecture. internationale.International audienceState-of-the-art auto...
Abstract—This letter investigates the problem of incorporating auxiliary information, e.g., pitch, z...
The use of visual features in audio-visual speech recognition (AVSR) is justified by both the speec...
We describe a dynamic Bayesian network for articulatory feature recognition. The model is intended t...