Disentangling factors of variation aims to uncover latent variables that underlie the process of data generation. In this paper, we propose a framework that achieves unsupervised pitch and timbre disentanglement for isolated musical instrument sounds without relying on data annotations or pre-trained neural networks. Our framework, based on variational auto-encoders, takes as input a spectral frame, and encodes pitch and timbre as categorical and continuous variables, respectively. The input is then reconstructed by combining those variables. Under an unsupervised training setting, a major challenge is that encoders are tasked to capture factors of interest with distinct latent representations, without access to the corresponding ground-tru...
This paper presents a convolutional neural network (CNN) that uses input from a polyphonic pitch est...
International audienceAlthough extensively studied for many years, defining the timbre of musical so...
FM Synthesis is a well-known algorithm used to generate complex timbre from a compact set of design ...
In this paper, we learn disentangled representations of timbre and pitch for musical instrument soun...
We propose a unified model for three inter-related tasks: 1) to separate individual sound sources fr...
Controllability, despite being a much-desired property of a generative model, remains an ill-defined...
Automatic music generation is an attractive topic in the interdisciplinary field of music and comput...
Improving controllability or the ability to manipulate one or more attributes of the generated data ...
Analogy-making is a key method for computer algorithms to generate both natural and creative music p...
Automatic music generation is an interdisciplinary research topic that combines computational creati...
Variational Autoencoder has already achieved great results on image generation and recently made pro...
Generative models aim to understand the properties of data, through the construction of latent space...
Abstract — Source separation of musical signals is an appealing but difficult problem, especially in...
We introduce a machine learning technique to autonomously generate novel melodies that are variation...
International audienceMultiple pitch estimation consists of estimating the fundamental frequencies a...
This paper presents a convolutional neural network (CNN) that uses input from a polyphonic pitch est...
International audienceAlthough extensively studied for many years, defining the timbre of musical so...
FM Synthesis is a well-known algorithm used to generate complex timbre from a compact set of design ...
In this paper, we learn disentangled representations of timbre and pitch for musical instrument soun...
We propose a unified model for three inter-related tasks: 1) to separate individual sound sources fr...
Controllability, despite being a much-desired property of a generative model, remains an ill-defined...
Automatic music generation is an attractive topic in the interdisciplinary field of music and comput...
Improving controllability or the ability to manipulate one or more attributes of the generated data ...
Analogy-making is a key method for computer algorithms to generate both natural and creative music p...
Automatic music generation is an interdisciplinary research topic that combines computational creati...
Variational Autoencoder has already achieved great results on image generation and recently made pro...
Generative models aim to understand the properties of data, through the construction of latent space...
Abstract — Source separation of musical signals is an appealing but difficult problem, especially in...
We introduce a machine learning technique to autonomously generate novel melodies that are variation...
International audienceMultiple pitch estimation consists of estimating the fundamental frequencies a...
This paper presents a convolutional neural network (CNN) that uses input from a polyphonic pitch est...
International audienceAlthough extensively studied for many years, defining the timbre of musical so...
FM Synthesis is a well-known algorithm used to generate complex timbre from a compact set of design ...