In this paper, we propose the use of speaker embedding networks to perform zero-shot singing voice conversion, and suggest two architectures for its realization. The use of speaker embedding networks not only enables the capability to adapt to new voices on-the-fly, but also allows for model training on unlabeled data. This not only facilitates the collection of suitable singing voice data, but also allows networks to be pretrained on large speech corpora before being refined on singing voice datasets, improving network generalization. We illustrate the effectiveness of the proposed zero-shot singing voice conversion algorithms by both qualitative and quantitative means
This thesis aims to implement a voice conversion system that transforms one persons voice into anoth...
Abstract Voice conversion is to transform a source speaker to the target one, while keeping the ling...
Text-to-speech (TTS) and singing voice synthesis (SVS) aim at generating high-quality speaking and s...
The task of Singing Voice Conversion(SVC) is to transform the voice of one singer(source) to someone...
Recent advances in deep learning not only facilitate the implementation of zero-shot singing voice s...
Speech recognition in singing is a task that has not been widely researched so far. Singing possesse...
In this paper, we use artificial neural networks (ANNs) for voice conversion and exploit the mapping...
The zero-shot scenario for speech generation aims at synthesizing a novel unseen voice with only one...
Many-to-many voice conversion with non-parallel training data has seen significant progress in recen...
We propose voice conversion model from arbitrary source speaker to arbitrary target speaker with dis...
Notable progress in music source separation has been achieved using multi-branch networks that opera...
Singing Voice Conversion is the task of converting the timbre of a source singer to another one with...
In this paper, we suggest a novel way to train GenerativeAdversarial Network (GAN) for the purpose o...
In this paper we propose a scheme for developing a voice conversion system that converts the speech ...
In this paper, we evaluate our proposed singing voice conver-sion method from various perspectives. ...
This thesis aims to implement a voice conversion system that transforms one persons voice into anoth...
Abstract Voice conversion is to transform a source speaker to the target one, while keeping the ling...
Text-to-speech (TTS) and singing voice synthesis (SVS) aim at generating high-quality speaking and s...
The task of Singing Voice Conversion(SVC) is to transform the voice of one singer(source) to someone...
Recent advances in deep learning not only facilitate the implementation of zero-shot singing voice s...
Speech recognition in singing is a task that has not been widely researched so far. Singing possesse...
In this paper, we use artificial neural networks (ANNs) for voice conversion and exploit the mapping...
The zero-shot scenario for speech generation aims at synthesizing a novel unseen voice with only one...
Many-to-many voice conversion with non-parallel training data has seen significant progress in recen...
We propose voice conversion model from arbitrary source speaker to arbitrary target speaker with dis...
Notable progress in music source separation has been achieved using multi-branch networks that opera...
Singing Voice Conversion is the task of converting the timbre of a source singer to another one with...
In this paper, we suggest a novel way to train GenerativeAdversarial Network (GAN) for the purpose o...
In this paper we propose a scheme for developing a voice conversion system that converts the speech ...
In this paper, we evaluate our proposed singing voice conver-sion method from various perspectives. ...
This thesis aims to implement a voice conversion system that transforms one persons voice into anoth...
Abstract Voice conversion is to transform a source speaker to the target one, while keeping the ling...
Text-to-speech (TTS) and singing voice synthesis (SVS) aim at generating high-quality speaking and s...