State-of-the-art singing voice separation is based on deep learning making use of CNN structures with skip connections (like U-net model, Wave-U-Net model, or MSDENSELSTM). A key to the success of these models is the availability of a large amount of training data. In the following study, we are interested in singing voice separation for mono signals and will investigate into comparing the U-Net and the Wave-U-Net that are structurally similar, but work on different input representations. First, we report a few results on variations of the U-Net model. Second, we will discuss the potential of state of the art speech and music transformation algorithms for augmentation of existing data sets and demonstrate that the effect of these augmentati...
Singing voice separation based on deep learning relies on the usage of time-frequency masking. In ma...
Choral singing is a widely practiced form of ensemble singing wherein a group of people sing simulta...
The objective of deep learning methods based on encoder-decoder architectures for music source separ...
Monaural singing voice separation has received much attention in recent years. In this paper, we pro...
Singing Voice Separation (SVS) tries to separate singing voice from a given mixed musical signal. Re...
Informed source separation has recently gained renewed interest with the introduction of neural netw...
Monaural source separation is a challenging issue due to the fact that there is only a single channe...
Automatic sung speech recognition is a challenging problem that remains largely unsolved. Challenges...
The decomposition of a music audio signal into its vocal and backing track components is analogous t...
Models for audio source separation usually operate on the magnitude spectrum, which ignores phase in...
Audio Source Separation concerns the field of study, where the general aim is to isolate the sources...
Notable progress in music source separation has been achieved using multi-branch networks that opera...
Deep neural networks have become a cornerstone in various recognition and classification tasks due t...
State-of-the-art methods for monaural singing voice separation consist in estimating the magnitude s...
Vocal source separation and fundamental frequency estimation in music are tightly related tasks. The...
Singing voice separation based on deep learning relies on the usage of time-frequency masking. In ma...
Choral singing is a widely practiced form of ensemble singing wherein a group of people sing simulta...
The objective of deep learning methods based on encoder-decoder architectures for music source separ...
Monaural singing voice separation has received much attention in recent years. In this paper, we pro...
Singing Voice Separation (SVS) tries to separate singing voice from a given mixed musical signal. Re...
Informed source separation has recently gained renewed interest with the introduction of neural netw...
Monaural source separation is a challenging issue due to the fact that there is only a single channe...
Automatic sung speech recognition is a challenging problem that remains largely unsolved. Challenges...
The decomposition of a music audio signal into its vocal and backing track components is analogous t...
Models for audio source separation usually operate on the magnitude spectrum, which ignores phase in...
Audio Source Separation concerns the field of study, where the general aim is to isolate the sources...
Notable progress in music source separation has been achieved using multi-branch networks that opera...
Deep neural networks have become a cornerstone in various recognition and classification tasks due t...
State-of-the-art methods for monaural singing voice separation consist in estimating the magnitude s...
Vocal source separation and fundamental frequency estimation in music are tightly related tasks. The...
Singing voice separation based on deep learning relies on the usage of time-frequency masking. In ma...
Choral singing is a widely practiced form of ensemble singing wherein a group of people sing simulta...
The objective of deep learning methods based on encoder-decoder architectures for music source separ...