Notable progress in music source separation has been achieved using multi-branch networks that operate on both temporal and spectral domains. However, such networks tend to be complex and heavy-weighted. In this work, we tackle the task of singing voice extraction from polyphonic music signals in an end-to-end manner using an approach inspired by the training procedure of denoising diffusion models. We perform unconditional signal modelling to gradually convert an input mixture signal to the corresponding singing voice or accompaniment. We use fewer parameters than the state-of-the-art models while operating on the waveform domain, bypassing phase-related problems. More concisely, we train a non-causal WaveNet using a diffusion-inspired str...
Comunicació i pòster presentats a l'Interspeech 2017 celebrat del 20 al 24 d'agost a Estocolm, Suèci...
[[abstract]]Monaural singing voice separation is an extremely challenging problem. While efforts in ...
State-of-the-art methods for monaural singing voice separation consist in estimating the magnitude s...
This work has been accepted at the 23rd International Society for Music Information Retrieval Confer...
A vocoder is a conditional audio generation model that converts acoustic features such as mel-spectr...
Comunicació presentada a: 2019 IEEE International Conference on Acoustics, Speech and Signal Process...
Comunicació presentada a: 2019 IEEE International Conference on Acoustics, Speech and Signal Process...
This paper presents two systems for extracting the vocals from a musical piece. Vocals extraction fi...
Singing voice synthesis (SVS) systems are built to synthesize high-quality and expressive singing vo...
Singing voice separation based on deep learning relies on the usage of time-frequency masking. In ma...
This thesis dissertation focuses on singing voice extraction from polyphonic musical signals. In par...
Identification and extraction of singing voice from within musical mixtures is a key challenge in so...
State-of-the-art singing voice separation is based on deep learning making use of CNN structures wit...
The objective of deep learning methods based on encoder-decoder architectures for music source separ...
Comunicació i pòster presentats a l'Interspeech 2017 celebrat del 20 al 24 d'agost a Estocolm, Suèci...
Comunicació i pòster presentats a l'Interspeech 2017 celebrat del 20 al 24 d'agost a Estocolm, Suèci...
[[abstract]]Monaural singing voice separation is an extremely challenging problem. While efforts in ...
State-of-the-art methods for monaural singing voice separation consist in estimating the magnitude s...
This work has been accepted at the 23rd International Society for Music Information Retrieval Confer...
A vocoder is a conditional audio generation model that converts acoustic features such as mel-spectr...
Comunicació presentada a: 2019 IEEE International Conference on Acoustics, Speech and Signal Process...
Comunicació presentada a: 2019 IEEE International Conference on Acoustics, Speech and Signal Process...
This paper presents two systems for extracting the vocals from a musical piece. Vocals extraction fi...
Singing voice synthesis (SVS) systems are built to synthesize high-quality and expressive singing vo...
Singing voice separation based on deep learning relies on the usage of time-frequency masking. In ma...
This thesis dissertation focuses on singing voice extraction from polyphonic musical signals. In par...
Identification and extraction of singing voice from within musical mixtures is a key challenge in so...
State-of-the-art singing voice separation is based on deep learning making use of CNN structures wit...
The objective of deep learning methods based on encoder-decoder architectures for music source separ...
Comunicació i pòster presentats a l'Interspeech 2017 celebrat del 20 al 24 d'agost a Estocolm, Suèci...
Comunicació i pòster presentats a l'Interspeech 2017 celebrat del 20 al 24 d'agost a Estocolm, Suèci...
[[abstract]]Monaural singing voice separation is an extremely challenging problem. While efforts in ...
State-of-the-art methods for monaural singing voice separation consist in estimating the magnitude s...