We introduce a method of incorporating additional knowledge sources into an HMM-based statistical acoustic model. The probabilistic relationship between information sources is first learned through a Bayesian network to easily integrate any ad-ditional knowledge sources that might come from any domain and then the global joint probability density function (PDF) of the model is formulated. Where the model becomes too com-plex and direct BN inference is intractable, we utilize a junction tree algorithm to decompose the global joint PDF into a linked set of local conditional PDFs. This way, a simplified form of the model can be constructed and reliably estimated using a limited amount of training data. Here, we apply this framework to in-corpo...
Abstract During the last decade, the most significant advances in the field of continuous speech rec...
During the last decade the field of speech recognition has used the theory of hidden Markov models (...
International audienceHeterogeneous knowledge sources that model speech only at certain time frames ...
Abstract We propose a new method of incorporating the additional knowledge of accent, gender, and wi...
Automatic speech recognition (ASR) systems usually consist of an acoustic model and a language model...
This article provides a unifying Bayesian view on various approaches for acoustic model adaptation, ...
In this paper, we explore how different acoustic modeling tech-niques can benefit from data in langu...
This thesis investigates a stochastic modeling approach to word hypothesis of phonetic strings for a...
This paper presents the work done to improve the recognition rate lit an isolated word recognition p...
This paper describes the application of Bayesian networks to automatic speech recognition. Bayesian ...
Abstract Most of the current state-of-the-art speech recognition systems are based on speech signal ...
The entire dissertation/thesis text is included in the research.pdf file; the official abstract appe...
AbstractThis paper introduces an autoregressive hidden Markov model (HMM) and demonstrates its appli...
Abstract. This paper describes several ways of acoustic keywords spot-ting (KWS), based on Gaussian ...
For over a decade, the Hidden Markov Model (HMM) has been the primary tool used for acoustic modelin...
Abstract During the last decade, the most significant advances in the field of continuous speech rec...
During the last decade the field of speech recognition has used the theory of hidden Markov models (...
International audienceHeterogeneous knowledge sources that model speech only at certain time frames ...
Abstract We propose a new method of incorporating the additional knowledge of accent, gender, and wi...
Automatic speech recognition (ASR) systems usually consist of an acoustic model and a language model...
This article provides a unifying Bayesian view on various approaches for acoustic model adaptation, ...
In this paper, we explore how different acoustic modeling tech-niques can benefit from data in langu...
This thesis investigates a stochastic modeling approach to word hypothesis of phonetic strings for a...
This paper presents the work done to improve the recognition rate lit an isolated word recognition p...
This paper describes the application of Bayesian networks to automatic speech recognition. Bayesian ...
Abstract Most of the current state-of-the-art speech recognition systems are based on speech signal ...
The entire dissertation/thesis text is included in the research.pdf file; the official abstract appe...
AbstractThis paper introduces an autoregressive hidden Markov model (HMM) and demonstrates its appli...
Abstract. This paper describes several ways of acoustic keywords spot-ting (KWS), based on Gaussian ...
For over a decade, the Hidden Markov Model (HMM) has been the primary tool used for acoustic modelin...
Abstract During the last decade, the most significant advances in the field of continuous speech rec...
During the last decade the field of speech recognition has used the theory of hidden Markov models (...
International audienceHeterogeneous knowledge sources that model speech only at certain time frames ...