Recently we have developed a novel type of structure-based speech recognizer, which uses parameterized, non-recursive “hidden ” trajectory model of vocal tract resonances (VTR) or formants to capture the dynamic structure of long-range speech coarticulation and reduction. The underlying model of this recognizer carries out bi-directional FIR filtering on the piecewise constant sequences of the VTR targets. In this paper, we elaborate on two key aspects of the model. First, the phonetic context controls the movement direction and thus the formation of the VTR trajectories. This provides “structured ” context dependency for speech acoustics without using context dependent parameters as required by HMMs. Second, VTR targets as the key context-...
This paper presents a systematic framework for accurate estimation of vocal tract resonances (forman...
ICASSP2006: IEEE International Conference on Acoustics, Speech, and Signal Processing, May 14-19, ...
This paper introduces a new approach to acoustic-phonetic modelling, the Hidden Dynamic Model (HDM),...
Abstract—Modeling dynamic structure of speech is a novel paradigm in speech recognition research wit...
Abstract—A structured generative model of speech coar-ticulation and reduction is described with a n...
We report in this paper our recent progress on the new devel-opment, implementation, and evaluation ...
We present in this paper an overview of the Hidden Dynamic Model (HDM) paradigm, exemplifying parame...
It has been shown in several recent publications that application of vocal tract normalization (VTN)...
We propose and evaluate a new acoustic model that combines HMM and a special type of the hidden dyna...
In this paper, a novel speaker normalization method is presented and compared to a well known vocal ...
One of the biggest difficulties in automatic speech recognition (ASR) is how to deal with variations...
Abstract. Most speech models represent the static and derivative cep-stral features with separate mo...
The conditional independence assumption imposed by the hidden Markov models (HMMs) makes it difficul...
Abstract—A novel Kalman filtering/smoothing algorithm is presented for efficient and accurate estima...
We describe a speech recogniser which uses a speech production-motivated phonetic-feature descriptio...
This paper presents a systematic framework for accurate estimation of vocal tract resonances (forman...
ICASSP2006: IEEE International Conference on Acoustics, Speech, and Signal Processing, May 14-19, ...
This paper introduces a new approach to acoustic-phonetic modelling, the Hidden Dynamic Model (HDM),...
Abstract—Modeling dynamic structure of speech is a novel paradigm in speech recognition research wit...
Abstract—A structured generative model of speech coar-ticulation and reduction is described with a n...
We report in this paper our recent progress on the new devel-opment, implementation, and evaluation ...
We present in this paper an overview of the Hidden Dynamic Model (HDM) paradigm, exemplifying parame...
It has been shown in several recent publications that application of vocal tract normalization (VTN)...
We propose and evaluate a new acoustic model that combines HMM and a special type of the hidden dyna...
In this paper, a novel speaker normalization method is presented and compared to a well known vocal ...
One of the biggest difficulties in automatic speech recognition (ASR) is how to deal with variations...
Abstract. Most speech models represent the static and derivative cep-stral features with separate mo...
The conditional independence assumption imposed by the hidden Markov models (HMMs) makes it difficul...
Abstract—A novel Kalman filtering/smoothing algorithm is presented for efficient and accurate estima...
We describe a speech recogniser which uses a speech production-motivated phonetic-feature descriptio...
This paper presents a systematic framework for accurate estimation of vocal tract resonances (forman...
ICASSP2006: IEEE International Conference on Acoustics, Speech, and Signal Processing, May 14-19, ...
This paper introduces a new approach to acoustic-phonetic modelling, the Hidden Dynamic Model (HDM),...