Concatenative speech synthesis depends on accurate segmentation of the pho-nemes in a training corpus. Segmentation is typically performed using HMMs, but recent speech recognition work suggests that the transient acoustic features charac-teristic of manner-class phoneme bounda-ries (landmarks) may be more precisely localized using acoustic classifiers spe-cifically designed for the task of landmark detection. This paper makes an empirical exploration of several features and tech-niques that are capable of improving the performance of the present HMM tools like HTK. On a standard benchmark data set, we achieve new state-of-the-art per-formance, reducing error in marking the landmarks from an average deviation of 28 milliseconds to 15 millis...
Techniques for automatic phoneme recognition from spoken speech are investigated. The goal is to ext...
The standard hidden Markov model (HMM) has been proved to be the most successful model for speech re...
Speech is composed of basic speech sounds called phonemes, and these subword units are the foundatio...
The training of precise speech recognition models depends on accurate segmentation of the phonemes i...
International audienceIn this work, we present a new approach for the classification and detection o...
Thesis (Ph. D.)--Massachusetts Institute of Technology, Dept. of Electrical Engineering and Computer...
A probabilistic and statistical framework is presented for automatic speech recognition based on a p...
Thesis (Ph. D.)—Massachusetts Institute of Technology, Dept. of Electrical Engineering and Computer ...
Acoustic landmarks (abrupt changes associated with consonant closures and releases, vowels and glide...
Thesis (M. Eng.)--Massachusetts Institute of Technology, Dept. of Electrical Engineering and Compute...
Artificial neural networks (ANNs) have been used to classify phonetic features in speech. The featur...
Statistical data-driven methods and knowledge-based methods are two recent trends in Automatic Speec...
The conditional independence assumption imposed by the hidden Markov models (HMMs) makes it difficul...
www.talp.upc.es In the present paper we present two novel approaches to phonetic speech segmentation...
In this study we propose two methods to improve HMM speech recognition performance. The first method...
Techniques for automatic phoneme recognition from spoken speech are investigated. The goal is to ext...
The standard hidden Markov model (HMM) has been proved to be the most successful model for speech re...
Speech is composed of basic speech sounds called phonemes, and these subword units are the foundatio...
The training of precise speech recognition models depends on accurate segmentation of the phonemes i...
International audienceIn this work, we present a new approach for the classification and detection o...
Thesis (Ph. D.)--Massachusetts Institute of Technology, Dept. of Electrical Engineering and Computer...
A probabilistic and statistical framework is presented for automatic speech recognition based on a p...
Thesis (Ph. D.)—Massachusetts Institute of Technology, Dept. of Electrical Engineering and Computer ...
Acoustic landmarks (abrupt changes associated with consonant closures and releases, vowels and glide...
Thesis (M. Eng.)--Massachusetts Institute of Technology, Dept. of Electrical Engineering and Compute...
Artificial neural networks (ANNs) have been used to classify phonetic features in speech. The featur...
Statistical data-driven methods and knowledge-based methods are two recent trends in Automatic Speec...
The conditional independence assumption imposed by the hidden Markov models (HMMs) makes it difficul...
www.talp.upc.es In the present paper we present two novel approaches to phonetic speech segmentation...
In this study we propose two methods to improve HMM speech recognition performance. The first method...
Techniques for automatic phoneme recognition from spoken speech are investigated. The goal is to ext...
The standard hidden Markov model (HMM) has been proved to be the most successful model for speech re...
Speech is composed of basic speech sounds called phonemes, and these subword units are the foundatio...