A well-known unfavorable property of HMMs in speech recognition is their inappropriate representation of phone and word durations. This paper describes an approach to resolve this limitation by integrating explicit word duration models into an HMM-based speech recognizer. Word durations are represented by log-normal densities using a back-off strategy that approximates durations of words that have been observed seldom by a combination of the statistics of suitable sub-word units. Furthermore, two different normalization procedures are compared which reduce the influence of the implicit HMM duration distribution resulting from the state-to-state transition probabilities. Experiments on European parliamentary speeches in English and Spanish l...
This paper describes a new approach to modeling duration for LVCSR using SCARF, a toolkit for speech...
While hidden Markov model (HMM) are the most widely used model in automatic speech recognition, its ...
Thesis (M.S.)--Massachusetts Institute of Technology, Dept. of Electrical Engineering and Computer S...
The duration of speech units is an important cue in speech recognition. But most of the current spee...
The occupancy of the HMM states is modeled by means of a Markov chain. A linear estimator is introdu...
When phone segmentations are known a priori, normalizing the duration of each phone has been shown t...
In this paper a method of integrating a model of suprasegmental duration with a HMM-based recogniser...
State transition matrices as used in standard HMM decoders have two widely perceived limitations. On...
A new duration intrinsic model for improved speech recognition by HMM techniques is presented. Assum...
Durations of real speech segments do not generally exhibit exponential distributions, as modelled im...
We present a method for speaker recognition that uses the duration patterns of speech units to aid s...
State transition matrices as used in standard HMM decoders have two widely perceived limitations. On...
Hidden Markov models (HMMs) are currently the most successful paradigm for speech recognition. Altho...
In this paper a method of integrating a model of suprasegmental duration with a HMM-based recogniser...
In this paper we investigate external phone duration models (PDMs) for improving the quality of synt...
This paper describes a new approach to modeling duration for LVCSR using SCARF, a toolkit for speech...
While hidden Markov model (HMM) are the most widely used model in automatic speech recognition, its ...
Thesis (M.S.)--Massachusetts Institute of Technology, Dept. of Electrical Engineering and Computer S...
The duration of speech units is an important cue in speech recognition. But most of the current spee...
The occupancy of the HMM states is modeled by means of a Markov chain. A linear estimator is introdu...
When phone segmentations are known a priori, normalizing the duration of each phone has been shown t...
In this paper a method of integrating a model of suprasegmental duration with a HMM-based recogniser...
State transition matrices as used in standard HMM decoders have two widely perceived limitations. On...
A new duration intrinsic model for improved speech recognition by HMM techniques is presented. Assum...
Durations of real speech segments do not generally exhibit exponential distributions, as modelled im...
We present a method for speaker recognition that uses the duration patterns of speech units to aid s...
State transition matrices as used in standard HMM decoders have two widely perceived limitations. On...
Hidden Markov models (HMMs) are currently the most successful paradigm for speech recognition. Altho...
In this paper a method of integrating a model of suprasegmental duration with a HMM-based recogniser...
In this paper we investigate external phone duration models (PDMs) for improving the quality of synt...
This paper describes a new approach to modeling duration for LVCSR using SCARF, a toolkit for speech...
While hidden Markov model (HMM) are the most widely used model in automatic speech recognition, its ...
Thesis (M.S.)--Massachusetts Institute of Technology, Dept. of Electrical Engineering and Computer S...