Singing Voice Separation (SVS) tries to separate singing voice from a given mixed musical signal. Recently, many U-Net-based models have been proposed for the SVS task, but there were no existing works that evaluate and compare various types of intermediate blocks that can be used in the U-Net architecture. In this paper, we introduce a variety of intermediate spectrogram transformation blocks. We implement U-nets based on these blocks and train them on complex-valued spectrograms to consider both magnitude and phase. These networks are then compared on the SDR metric. When using a particular block composed of convolutional and fully-connected layers, it achieves state-of-the-art SDR on the MUSDB singing voice separation task by a large mar...
Deep neural networks with convolutional layers usually process the entire spectrogram of an audio si...
Comunicació presentada a: International Society for Music Information Retrieval Conference celebrat ...
This paper presents two systems for extracting the vocals from a musical piece. Vocals extraction fi...
State-of-the-art singing voice separation is based on deep learning making use of CNN structures wit...
Models for audio source separation usually operate on the magnitude spectrum, which ignores phase in...
Monaural singing voice separation has received much attention in recent years. In this paper, we pro...
The decomposition of a music audio signal into its vocal and backing track components is analogous t...
Informed source separation has recently gained renewed interest with the introduction of neural netw...
Notable progress in music source separation has been achieved using multi-branch networks that opera...
A new single channel singing voice separation algorithm is presented in this paper. This field of si...
[[abstract]]Monaural singing voice separation is an extremely challenging problem. While efforts in ...
Choral singing is a widely practiced form of ensemble singing wherein a group of people sing simulta...
State-of-the-art methods for monaural singing voice separation consist in estimating the magnitude s...
Choral singing is a widely practiced form of ensemble singing wherein a group of people sing simulta...
This work has been accepted at the 23rd International Society for Music Information Retrieval Confer...
Deep neural networks with convolutional layers usually process the entire spectrogram of an audio si...
Comunicació presentada a: International Society for Music Information Retrieval Conference celebrat ...
This paper presents two systems for extracting the vocals from a musical piece. Vocals extraction fi...
State-of-the-art singing voice separation is based on deep learning making use of CNN structures wit...
Models for audio source separation usually operate on the magnitude spectrum, which ignores phase in...
Monaural singing voice separation has received much attention in recent years. In this paper, we pro...
The decomposition of a music audio signal into its vocal and backing track components is analogous t...
Informed source separation has recently gained renewed interest with the introduction of neural netw...
Notable progress in music source separation has been achieved using multi-branch networks that opera...
A new single channel singing voice separation algorithm is presented in this paper. This field of si...
[[abstract]]Monaural singing voice separation is an extremely challenging problem. While efforts in ...
Choral singing is a widely practiced form of ensemble singing wherein a group of people sing simulta...
State-of-the-art methods for monaural singing voice separation consist in estimating the magnitude s...
Choral singing is a widely practiced form of ensemble singing wherein a group of people sing simulta...
This work has been accepted at the 23rd International Society for Music Information Retrieval Confer...
Deep neural networks with convolutional layers usually process the entire spectrogram of an audio si...
Comunicació presentada a: International Society for Music Information Retrieval Conference celebrat ...
This paper presents two systems for extracting the vocals from a musical piece. Vocals extraction fi...