In this paper, we present a dictionary-based voice conversion (VC) approach that does not require parallel data or linguistic labeling for training process. Dictionary-based voice conversion is the class of methods aiming to decompose speech into separate factors for manipulation. Non-negative matrix factorization (NMF) is the most common method to decomposed input spectrum into a weighted linear combination of a set of bases (dictionary) and weights. However, the requirement for parallel training data in this method causes several problems: 1) limited practical usability when parallel data are not available, 2) additional error from alignment process degrades out-put speech quality. In order to alleviate these problems, this paper presents...
In this paper, we describe a novel speaker adaptation algorithm based on Gaussian mixture weight ada...
We propose voice conversion model from arbitrary source speaker to arbitrary target speaker with dis...
This paper presents a sparse representation framework for weighted frequency warping based voice con...
In this paper, we present a nonparallel voice conversion (VC) approach that does not require paralle...
This paper proposes a hierarchical latent embedding structure for Vector Quantized Variational Autoe...
We present in this paper a voice conver-sion (VC) method for a person with an ar-ticulation disorder...
International audienceMuch existing voice conversion (VC) systems are attractive owing to their high...
In this paper, we present a voice conversion (VC) method that does not use any parallel data while t...
Abstract In this paper, we present a voice conversion (VC) method that does not use any parallel dat...
Many research topics in speech processing face the same dif-ficult problem, how to create cheaply (o...
Abstract—A robust voice conversion function relies on a large amount of parallel training data, whic...
The objective of voice conversion techniques is to convert a source speaker's voice so that it sound...
Gburrek T, Ebbers J, Häb-Umbach R, Wagner P. Unsupervised Learning of a Disentangled Speech Represen...
Voice conversion (VC) transforms the speaking style of a source speaker to the speaking style of a t...
The objective of voice conversion methods is to modify the speech characteristics of a particular sp...
In this paper, we describe a novel speaker adaptation algorithm based on Gaussian mixture weight ada...
We propose voice conversion model from arbitrary source speaker to arbitrary target speaker with dis...
This paper presents a sparse representation framework for weighted frequency warping based voice con...
In this paper, we present a nonparallel voice conversion (VC) approach that does not require paralle...
This paper proposes a hierarchical latent embedding structure for Vector Quantized Variational Autoe...
We present in this paper a voice conver-sion (VC) method for a person with an ar-ticulation disorder...
International audienceMuch existing voice conversion (VC) systems are attractive owing to their high...
In this paper, we present a voice conversion (VC) method that does not use any parallel data while t...
Abstract In this paper, we present a voice conversion (VC) method that does not use any parallel dat...
Many research topics in speech processing face the same dif-ficult problem, how to create cheaply (o...
Abstract—A robust voice conversion function relies on a large amount of parallel training data, whic...
The objective of voice conversion techniques is to convert a source speaker's voice so that it sound...
Gburrek T, Ebbers J, Häb-Umbach R, Wagner P. Unsupervised Learning of a Disentangled Speech Represen...
Voice conversion (VC) transforms the speaking style of a source speaker to the speaking style of a t...
The objective of voice conversion methods is to modify the speech characteristics of a particular sp...
In this paper, we describe a novel speaker adaptation algorithm based on Gaussian mixture weight ada...
We propose voice conversion model from arbitrary source speaker to arbitrary target speaker with dis...
This paper presents a sparse representation framework for weighted frequency warping based voice con...