We present a new approach to solve the problem of phone segmentation when preparing databases for concatenative Text-to-Speech synthesis. First, we describe the problem and review the state of the art. Then we present some already existing techniques to perform this segmentation and present our approach based on a Regression Tree to perform Boundary Specific Correction of the HMM segmentation. We discus different evaluation procedures. Finally, we compare some systems and we show how our system improves the system based on HMMs setting 94 % of the boundaries within a tolerance of 20ms compared to a manual segmentation, and how phonetic rather than acoustical features are better suited for this task. 1
Automatic phone segmentation techniques based on model selection criteria are studied. We investigat...
Thesis (M.Ing. (Computer Engineering))--North-West University, Potchefstroom Campus, 2009.The rapid ...
Consistent phoneme segmentation is essential in building high quality Text-to-Speech (TTS) voice fon...
This paper deals with the problems of automatic segmentation for the purposes of Czech concatenative...
www.talp.upc.es In the present paper we present two novel approaches to phonetic speech segmentation...
International audienceThis paper introduces a new approach for the automatic segmentation of corpora...
This paper studies the performance of automatic phone segmentation from two viewpoints: (1) temporal...
International audienceThis paper introduces a new approach for the automatic segmentation of corpora...
For segmenting a speech database, using a family of acoustic models provides multiple estimates of e...
Phonetic segmentation is the breakup and classication of the sound signal into a string of phones. T...
: Concatenative Text-To-Speech synthesizers join pre-recorded segments of speech data in order to pr...
The paper describes a method for automatically segmenting a database of isolated words as required f...
This paper describes the refinement of the automatic speech segmentation into phones obtained via Hi...
Despite using different algorithms, most unsupervised automatic phone segmentation methods achieve s...
In this paper, after an a review of the previous work done in this field, the most frequently used a...
Automatic phone segmentation techniques based on model selection criteria are studied. We investigat...
Thesis (M.Ing. (Computer Engineering))--North-West University, Potchefstroom Campus, 2009.The rapid ...
Consistent phoneme segmentation is essential in building high quality Text-to-Speech (TTS) voice fon...
This paper deals with the problems of automatic segmentation for the purposes of Czech concatenative...
www.talp.upc.es In the present paper we present two novel approaches to phonetic speech segmentation...
International audienceThis paper introduces a new approach for the automatic segmentation of corpora...
This paper studies the performance of automatic phone segmentation from two viewpoints: (1) temporal...
International audienceThis paper introduces a new approach for the automatic segmentation of corpora...
For segmenting a speech database, using a family of acoustic models provides multiple estimates of e...
Phonetic segmentation is the breakup and classication of the sound signal into a string of phones. T...
: Concatenative Text-To-Speech synthesizers join pre-recorded segments of speech data in order to pr...
The paper describes a method for automatically segmenting a database of isolated words as required f...
This paper describes the refinement of the automatic speech segmentation into phones obtained via Hi...
Despite using different algorithms, most unsupervised automatic phone segmentation methods achieve s...
In this paper, after an a review of the previous work done in this field, the most frequently used a...
Automatic phone segmentation techniques based on model selection criteria are studied. We investigat...
Thesis (M.Ing. (Computer Engineering))--North-West University, Potchefstroom Campus, 2009.The rapid ...
Consistent phoneme segmentation is essential in building high quality Text-to-Speech (TTS) voice fon...