The objective of voice conversion techniques is to convert a source speaker's voice so that it sounds like that of a target speaker. Voice conversion belongs to a popular area of personalized speech generation. On one hand, it can be applied to solve problems, e.g. emotion conversion, improving the intelligibility of speech, or change whisper/murmur into speech. On the other hand, voice conversion also presents a threat to Automatic Speaker Verification (ASV) systems. Synthetic speech detection is a technique to discriminate between live and synthetic speech. In this sense, it provides a feasible way to improve robustness and protect ASV systems. This thesis focuses on the following two aspects: improving performance in voice conversion and...
Synthetic speech is speech signals generated by text-to-speech (TTS) and voice conversion (VC) techn...
This paper presents a sparse representation framework for weighted frequency warping based voice con...
Voice conversion (VC) transforms the speaking style of a source speaker to the speaking style of a t...
Voice conversion is the process to modify a speech signal of one speaker (source) to sound like an i...
The voice conversion’s task is to modify a source speaker’s voice to sound like that of a target spe...
Voice conversion is a speech technology encompassing transformations applied to the speech signal wi...
International audienceIn Voice Conversion (VC), the speech of a source speaker is modified to resemb...
Speaker identity plays an important role in human communication. In addition to the linguistic conte...
Voice conversion (VC) is a technique to transform a speaker identity included in a source speech wav...
The main challenge introduced in current voice conversion is the tradeoff between speaker similarity...
The performance of biometric systems based on automatic speaker recognition technology is severely d...
Voice conversion (VC) is a technique to transform a speaker identity included in a source speech wav...
Voice conversion (VC) is a technique to transform a speaker identity included in a source speech wav...
Kuhlmann M, Seebauer FM, Ebbers J, Wagner P, Haeb-Umbach R. Investigation into Target Speaking Rate ...
International audienceMuch existing voice conversion (VC) systems are attractive owing to their high...
Synthetic speech is speech signals generated by text-to-speech (TTS) and voice conversion (VC) techn...
This paper presents a sparse representation framework for weighted frequency warping based voice con...
Voice conversion (VC) transforms the speaking style of a source speaker to the speaking style of a t...
Voice conversion is the process to modify a speech signal of one speaker (source) to sound like an i...
The voice conversion’s task is to modify a source speaker’s voice to sound like that of a target spe...
Voice conversion is a speech technology encompassing transformations applied to the speech signal wi...
International audienceIn Voice Conversion (VC), the speech of a source speaker is modified to resemb...
Speaker identity plays an important role in human communication. In addition to the linguistic conte...
Voice conversion (VC) is a technique to transform a speaker identity included in a source speech wav...
The main challenge introduced in current voice conversion is the tradeoff between speaker similarity...
The performance of biometric systems based on automatic speaker recognition technology is severely d...
Voice conversion (VC) is a technique to transform a speaker identity included in a source speech wav...
Voice conversion (VC) is a technique to transform a speaker identity included in a source speech wav...
Kuhlmann M, Seebauer FM, Ebbers J, Wagner P, Haeb-Umbach R. Investigation into Target Speaking Rate ...
International audienceMuch existing voice conversion (VC) systems are attractive owing to their high...
Synthetic speech is speech signals generated by text-to-speech (TTS) and voice conversion (VC) techn...
This paper presents a sparse representation framework for weighted frequency warping based voice con...
Voice conversion (VC) transforms the speaking style of a source speaker to the speaking style of a t...