Despite significant advancements in deep learning for vision and natural language, unsupervised domain adaptation in audio remains relatively unexplored. We, in part, attribute this to the lack of an appropriate benchmark dataset. To address this gap, we present Synthia's melody, a novel audio data generation framework capable of simulating an infinite variety of 4-second melodies with user-specified confounding structures characterised by musical keys, timbre, and loudness. Unlike existing datasets collected under observational settings, Synthia's melody is free of unobserved biases, ensuring the reproducibility and comparability of experiments. To showcase its utility, we generate two types of distribution shifts-domain shift and sample s...
This paper proposes a new benchmark task for generat-ing musical passages in the audio domain by usi...
This paper addresses the problem of domain adaptation for the task of music source separation. Using...
Comunicació presentada a: Workshop Machine Learning for Audio Signal Processing at NIPS 2017 (ML4Aud...
The ability to convey information using sound is critical for the survival of many vocal species, in...
Computer assisted music extensively relies on audio sample libraries and virtual instruments which p...
The advent of hyper-scale and general-purpose pre-trained models is shifting the paradigm of buildin...
Machine learning algorithms have achieved the state-of-the-art results by utilizing deep neural netw...
The performance of machine learning (ML) models is known to be affected by discrepancies between tra...
Distribution mismatches between the data seen at training and at application time remain a major cha...
Robust emotion recognition systems require extensive training by employing huge number of training s...
International audienceRecent progress in deep learning for audio synthesis opens the way to models t...
Audio classification plays a crucial role in speech and sound processing tasks with a wide range of ...
Music and speech exhibit striking similarities in the communication of emotions in the acoustic doma...
Pre-trained models are essential as feature extractors in modern machine learning systems in various...
We study the problem of source separation for music using deep learning with four known sources: dru...
This paper proposes a new benchmark task for generat-ing musical passages in the audio domain by usi...
This paper addresses the problem of domain adaptation for the task of music source separation. Using...
Comunicació presentada a: Workshop Machine Learning for Audio Signal Processing at NIPS 2017 (ML4Aud...
The ability to convey information using sound is critical for the survival of many vocal species, in...
Computer assisted music extensively relies on audio sample libraries and virtual instruments which p...
The advent of hyper-scale and general-purpose pre-trained models is shifting the paradigm of buildin...
Machine learning algorithms have achieved the state-of-the-art results by utilizing deep neural netw...
The performance of machine learning (ML) models is known to be affected by discrepancies between tra...
Distribution mismatches between the data seen at training and at application time remain a major cha...
Robust emotion recognition systems require extensive training by employing huge number of training s...
International audienceRecent progress in deep learning for audio synthesis opens the way to models t...
Audio classification plays a crucial role in speech and sound processing tasks with a wide range of ...
Music and speech exhibit striking similarities in the communication of emotions in the acoustic doma...
Pre-trained models are essential as feature extractors in modern machine learning systems in various...
We study the problem of source separation for music using deep learning with four known sources: dru...
This paper proposes a new benchmark task for generat-ing musical passages in the audio domain by usi...
This paper addresses the problem of domain adaptation for the task of music source separation. Using...
Comunicació presentada a: Workshop Machine Learning for Audio Signal Processing at NIPS 2017 (ML4Aud...