Machine learning based singing voice models require large datasets and lengthy training times. In this work we present a lightweight architecture, based on the Differentiable Digital Signal Processing (DDSP) library, that is able to output song-like utterances conditioned only on pitch and amplitude, after twelve hours of training using small datasets of unprocessed audio. The results are promising, as both the melody and the singer’s voice are recognizable. In addition, we explore the unused latent- vector in DDSP to improve the lyrics. Furthermore, we present two zeroconfiguration tools to train new models, including our experimental models. Our results indicate that the latent- improves both the identification of the singer as well as th...
Singing voice synthesis (SVS) systems are built to synthesize high-quality and expressive singing vo...
In this paper, we propose a novel area of research referred to as singing information processing. To...
Oftentimes when we listen to a familiar singer, the unique qual-ities of that performer’s voice allo...
A vocoder is a conditional audio generation model that converts acoustic features such as mel-spectr...
Speech recognition in singing is a task that has not been widely researched so far. Singing possesse...
Recent advances in deep learning not only facilitate the implementation of zero-shot singing voice s...
We recently presented a new model for singing synthesis based on a modified version of the WaveNet a...
In singing voice synthesis process, score and lyrics for a target song are converted to singing voic...
We recently presented a new model for singing synthesis based on a modified version of the WaveNet a...
The task of Singing Voice Conversion(SVC) is to transform the voice of one singer(source) to someone...
This paper introduces a new open-source platform named Muskits for end-to-end music processing, whic...
Individual singing voices tend to be easy for a listener to identify, particularly when compared to ...
Comunicació presentada al EUSIPCO 2019: 27th European Signal Processing Conference, celebrat del 2 a...
Comunicació i pòster presentats a l'Interspeech 2017 celebrat del 20 al 24 d'agost a Estocolm, Suèci...
Notable progress in music source separation has been achieved using multi-branch networks that opera...
Singing voice synthesis (SVS) systems are built to synthesize high-quality and expressive singing vo...
In this paper, we propose a novel area of research referred to as singing information processing. To...
Oftentimes when we listen to a familiar singer, the unique qual-ities of that performer’s voice allo...
A vocoder is a conditional audio generation model that converts acoustic features such as mel-spectr...
Speech recognition in singing is a task that has not been widely researched so far. Singing possesse...
Recent advances in deep learning not only facilitate the implementation of zero-shot singing voice s...
We recently presented a new model for singing synthesis based on a modified version of the WaveNet a...
In singing voice synthesis process, score and lyrics for a target song are converted to singing voic...
We recently presented a new model for singing synthesis based on a modified version of the WaveNet a...
The task of Singing Voice Conversion(SVC) is to transform the voice of one singer(source) to someone...
This paper introduces a new open-source platform named Muskits for end-to-end music processing, whic...
Individual singing voices tend to be easy for a listener to identify, particularly when compared to ...
Comunicació presentada al EUSIPCO 2019: 27th European Signal Processing Conference, celebrat del 2 a...
Comunicació i pòster presentats a l'Interspeech 2017 celebrat del 20 al 24 d'agost a Estocolm, Suèci...
Notable progress in music source separation has been achieved using multi-branch networks that opera...
Singing voice synthesis (SVS) systems are built to synthesize high-quality and expressive singing vo...
In this paper, we propose a novel area of research referred to as singing information processing. To...
Oftentimes when we listen to a familiar singer, the unique qual-ities of that performer’s voice allo...