In this paper we present a new method to synthesize multiple languages with the voice of any arbitrary speaker. We call this method “HMM-based speaker-adaptable polyglot synthesis”. The idea consists in mixing data from several speakers in different languages to create a speaker-independent multilingual acoustic model. By means of MLLR, we can adapt this model to the voice of any given speaker. With the adapted model, it is possible to synthesize speech in any of the languages included in the training corpus with the voice of the target speaker, regardless of the language spoken by that speaker. When the language to be synthesized and the language of the target speaker are different, the performance of our method is better than that of othe...
Speech as a means of communication is most natural to human beings. Therefore, it should be straight...
In the EMIME project we have studied unsupervised cross-lingual speaker adaptation. We have employed...
A phone mapping-based method had been introduced forcross-lingual speaker adaptation in HMM-based sp...
Today, speech synthesizers in new languages are typically built by collecting several hours of well ...
The paper presents a novel architecture and method for speech synthesis in multiple languages, in vo...
This work explores multilingual speech synthesis. We compare three models based on Tacotron that uti...
Text-to-speech synthesis (TTS) turns a written text into an audio speech signal. Many commercial sys...
The task of text-to-speech (TTS) synthesis usually refers to a single language and to a single speak...
Current text-to-speech (TTS) systems are increasingly faced with mixed language tex-tual input. Most...
While the synthesis of natural sounding, neutral style speech can be achieved using today’s technolo...
An increasingly common scenario in building speech synthesis and recognition systems is training on ...
This paper describes a technique for synthesizing speech with any desired voice. The technique is ba...
We work to create a multilingual speech synthesis system which can generate speech with the proper a...
This paper deals with the creation of multiple voices from a Hidden Markov Model based speech synthe...
This paper presents a new approach to cross-lingual voice transformation in HMM-based TTS with only ...
Speech as a means of communication is most natural to human beings. Therefore, it should be straight...
In the EMIME project we have studied unsupervised cross-lingual speaker adaptation. We have employed...
A phone mapping-based method had been introduced forcross-lingual speaker adaptation in HMM-based sp...
Today, speech synthesizers in new languages are typically built by collecting several hours of well ...
The paper presents a novel architecture and method for speech synthesis in multiple languages, in vo...
This work explores multilingual speech synthesis. We compare three models based on Tacotron that uti...
Text-to-speech synthesis (TTS) turns a written text into an audio speech signal. Many commercial sys...
The task of text-to-speech (TTS) synthesis usually refers to a single language and to a single speak...
Current text-to-speech (TTS) systems are increasingly faced with mixed language tex-tual input. Most...
While the synthesis of natural sounding, neutral style speech can be achieved using today’s technolo...
An increasingly common scenario in building speech synthesis and recognition systems is training on ...
This paper describes a technique for synthesizing speech with any desired voice. The technique is ba...
We work to create a multilingual speech synthesis system which can generate speech with the proper a...
This paper deals with the creation of multiple voices from a Hidden Markov Model based speech synthe...
This paper presents a new approach to cross-lingual voice transformation in HMM-based TTS with only ...
Speech as a means of communication is most natural to human beings. Therefore, it should be straight...
In the EMIME project we have studied unsupervised cross-lingual speaker adaptation. We have employed...
A phone mapping-based method had been introduced forcross-lingual speaker adaptation in HMM-based sp...