Abstract—Cross-lingual speaker adaptation (CLSA) has emerged as a new challenge in statistical parametric speech syn-thesis, with specific application to speech-to-speech translation. Recent research has shown that reasonable speaker similarity can be achieved in CLSA using maximum likelihood linear transformation of model parameters, but this method also has weaknesses due to the inherent mismatch caused by differing phonetic inventories of languages. In this paper, we propose that fast and effective CLSA can be made using vocal tract length normalization (VTLN), where strong constraints of the vocal tract warping function may actually help to avoid the most severe effects of the aforementioned mismatch. VTLN has a single parameter that wa...
A phone mapping-based method had been introduced for cross-lingual speaker adaptation in HMM-based s...
Vocal Tract Length Normalisation (VTLN) is a commonly used technique to normalise for inter-speaker...
This thesis deals with text-independent solutions for voice conversion. It first introduces the use ...
Vocal tract length normalization (VTLN) has been successfully used in automatic speech recognition f...
Recent research has demonstrated the effectiveness of vocal tract length normalization (VTLN) as a r...
Recent research has demonstrated the effectiveness of vocal tract length normalization (VTLN) as a r...
Recent research has demonstrated the effectiveness of vocal tract length normalization (VTLN) as a r...
Vocal tract length normalisation (VTLN) is a well known rapid adaptation technique. VTLN as a linear...
Vocal tract length normalization is an important feature normalization technique that can be used to...
This paper proposes an improved cross-lingualspeaker adaptation technique with considering the diffe...
The advent of statistical speech synthesis has enabled the unification of the basic techniques used ...
Generally speaking, the speaker-dependence of a speech recognition system stems from speaker-depende...
One of the main problems faced by automatic speech recognition is the variability of the testing con...
One of the main problems faced by automatic speech recognition is the variability of the testing con...
This paper provides an in-depth analysis of the impacts of language mismatch on the performance of c...
A phone mapping-based method had been introduced for cross-lingual speaker adaptation in HMM-based s...
Vocal Tract Length Normalisation (VTLN) is a commonly used technique to normalise for inter-speaker...
This thesis deals with text-independent solutions for voice conversion. It first introduces the use ...
Vocal tract length normalization (VTLN) has been successfully used in automatic speech recognition f...
Recent research has demonstrated the effectiveness of vocal tract length normalization (VTLN) as a r...
Recent research has demonstrated the effectiveness of vocal tract length normalization (VTLN) as a r...
Recent research has demonstrated the effectiveness of vocal tract length normalization (VTLN) as a r...
Vocal tract length normalisation (VTLN) is a well known rapid adaptation technique. VTLN as a linear...
Vocal tract length normalization is an important feature normalization technique that can be used to...
This paper proposes an improved cross-lingualspeaker adaptation technique with considering the diffe...
The advent of statistical speech synthesis has enabled the unification of the basic techniques used ...
Generally speaking, the speaker-dependence of a speech recognition system stems from speaker-depende...
One of the main problems faced by automatic speech recognition is the variability of the testing con...
One of the main problems faced by automatic speech recognition is the variability of the testing con...
This paper provides an in-depth analysis of the impacts of language mismatch on the performance of c...
A phone mapping-based method had been introduced for cross-lingual speaker adaptation in HMM-based s...
Vocal Tract Length Normalisation (VTLN) is a commonly used technique to normalise for inter-speaker...
This thesis deals with text-independent solutions for voice conversion. It first introduces the use ...