Naturalness of synthetic speech highly depends on appropriate modeling of prosodic aspects. Mostly, three prosody components are modeled: segmental duration, pitch contour and intensity. In this study, we present our work on modeling segmental duration in Turkish using machine-learning algorithms, especially Classification and Regression Trees (CART). The models predict phone durations based on attributes such as phone identity, neighboring phone identities, lexical stress, position of syllable in word, part-of-speech (POS) information, word length in number of syllables and position of word in utterance extracted from a speech corpus of approximately 700 sentences. Obtained models predict segment durations better than mean duration approxi...
In this paper, we propose a neural network model for predicting the durations of syllables. A four l...
Durations of the Turkish phonemes are investigated in this study using the high quality digital reco...
This study describes the tree-based modeling of prosodic phrasing, pause duration between phrases an...
Naturalness of synthetic speech highly depends on appropriate modeling of prosodic aspects. Mostly, ...
Naturalness of synthetic speech highly depends on appropriate modelling of prosodic aspects. Mostly,...
Text-to-Speech (TTS) synthesis can be regarded as the automatic transformation of sentences from the...
Acoustic analysis and synthesis experiments have shown that duration and intonation patterns are the...
One of the essential prerequisites for achieving the naturalness of synthesized speech is the possib...
Classification and regression tree approach was used in this research to model phone duration of Lit...
The results of two alternative models to predict segmental durations in speech synthesis, both based...
The thesis describes new analysis and modelling of Korean segmental duration. It takes into account ...
The results of two alternative models to predict segmental durations in speech synthesis, both based...
Segmental duration was investigated in a database of Polish read speech (from one male speaker). The...
Abstract: We are going to show the application of neural networks in one of the critical modules of ...
The Phoneme Dedicated Artificial Neural Network (PDANN) segmental duration model consists of a set o...
In this paper, we propose a neural network model for predicting the durations of syllables. A four l...
Durations of the Turkish phonemes are investigated in this study using the high quality digital reco...
This study describes the tree-based modeling of prosodic phrasing, pause duration between phrases an...
Naturalness of synthetic speech highly depends on appropriate modeling of prosodic aspects. Mostly, ...
Naturalness of synthetic speech highly depends on appropriate modelling of prosodic aspects. Mostly,...
Text-to-Speech (TTS) synthesis can be regarded as the automatic transformation of sentences from the...
Acoustic analysis and synthesis experiments have shown that duration and intonation patterns are the...
One of the essential prerequisites for achieving the naturalness of synthesized speech is the possib...
Classification and regression tree approach was used in this research to model phone duration of Lit...
The results of two alternative models to predict segmental durations in speech synthesis, both based...
The thesis describes new analysis and modelling of Korean segmental duration. It takes into account ...
The results of two alternative models to predict segmental durations in speech synthesis, both based...
Segmental duration was investigated in a database of Polish read speech (from one male speaker). The...
Abstract: We are going to show the application of neural networks in one of the critical modules of ...
The Phoneme Dedicated Artificial Neural Network (PDANN) segmental duration model consists of a set o...
In this paper, we propose a neural network model for predicting the durations of syllables. A four l...
Durations of the Turkish phonemes are investigated in this study using the high quality digital reco...
This study describes the tree-based modeling of prosodic phrasing, pause duration between phrases an...