International audienceMuch existing voice conversion (VC) systems are attractive owing to their high performance in terms of voice quality and speaker similarity. Nevertheless, without parallel training data, some generated waveform trajectories are not yet smooth, leading to degraded sound quality and mispronunciation issues in the converted speech. To address these shortcomings, this paper proposes a non-parallel VC system based on an auto-regressive model, Phonetic PosteriorGrams (PPGs), and an LPCnet vocoder to generate high-quality converted speech. The proposed auto-regressive structure makes our system able to produce the next step outputs from the previous step acoustic features. Further, the use of PPGs aims to convert any unknown ...
This paper proposes a hierarchical latent embedding structure for Vector Quantized Variational Autoe...
Voice conversion (VC) is a technique to transform a speaker identity included in a source speech wav...
This thesis deals with text-independent solutions for voice conversion. It first introduces the use ...
International audienceMuch existing voice conversion (VC) systems are attractive owing to their high...
We present an any-to-one voice conversion (VC) system, using an autoregressive model and LPCNet voco...
We propose a joint training scheme of an any-to-one voice conversion (VC) system with LPCNet to impr...
In this project a Phonetic Posteriorgram (PPG) based Voice Conversion system is implemented. The mai...
We propose voice conversion model from arbitrary source speaker to arbitrary target speaker with dis...
Voice conversion (VC) transforms the speaking style of a source speaker to the speaking style of a t...
The objective of voice conversion techniques is to convert a source speaker's voice so that it sound...
In this paper, we present a dictionary-based voice conversion (VC) approach that does not require pa...
Voice conversion (VC) is a technique to transform a speaker identity included in a source speech wav...
Recently, a lot of works has been done in speech technology. Text-to-Speech and Automatic Speech Rec...
In this paper, we present a nonparallel voice conversion (VC) approach that does not require paralle...
Voice conversion (VC) technology allows to transform the voice of the source speaker so that it is p...
This paper proposes a hierarchical latent embedding structure for Vector Quantized Variational Autoe...
Voice conversion (VC) is a technique to transform a speaker identity included in a source speech wav...
This thesis deals with text-independent solutions for voice conversion. It first introduces the use ...
International audienceMuch existing voice conversion (VC) systems are attractive owing to their high...
We present an any-to-one voice conversion (VC) system, using an autoregressive model and LPCNet voco...
We propose a joint training scheme of an any-to-one voice conversion (VC) system with LPCNet to impr...
In this project a Phonetic Posteriorgram (PPG) based Voice Conversion system is implemented. The mai...
We propose voice conversion model from arbitrary source speaker to arbitrary target speaker with dis...
Voice conversion (VC) transforms the speaking style of a source speaker to the speaking style of a t...
The objective of voice conversion techniques is to convert a source speaker's voice so that it sound...
In this paper, we present a dictionary-based voice conversion (VC) approach that does not require pa...
Voice conversion (VC) is a technique to transform a speaker identity included in a source speech wav...
Recently, a lot of works has been done in speech technology. Text-to-Speech and Automatic Speech Rec...
In this paper, we present a nonparallel voice conversion (VC) approach that does not require paralle...
Voice conversion (VC) technology allows to transform the voice of the source speaker so that it is p...
This paper proposes a hierarchical latent embedding structure for Vector Quantized Variational Autoe...
Voice conversion (VC) is a technique to transform a speaker identity included in a source speech wav...
This thesis deals with text-independent solutions for voice conversion. It first introduces the use ...