Audio representation learning based on deep neural networks (DNNs) emerged as an alternative approach to hand-crafted features. For achieving high performance, DNNs often need a large amount of annotated data which can be difficult and costly to obtain. In this paper, we propose a method for learning audio representations, aligning the learned latent representations of audio and associated tags. Aligning is done by maximizing the agreement of the latent representations of audio and tags, using a contrastive loss. The result is an audio embedding model which reflects acoustic and semantic characteristics of sounds. We evaluate the quality of our embedding model, measuring its performance as a feature extractor on three different tasks (namel...
PhDIn this thesis, I present my hypothesis, experiment results, and discussion that are related to v...
Identifying musical instruments in a polyphonic music recording is a difficult yet crucial problem i...
The success of supervised deep learning methods is largely due to their ability to learn relevant fe...
Self-supervised audio representation learning offers an attractive alternative for obtaining generic...
Modeling various aspects that make a music piece unique is a challenging task, requiring the combina...
Modeling various aspects that make a music piece unique is a challenging task, requiring the combina...
Inspired by the recent progress in self-supervised learning for computer vision, in this paper we in...
Pre-trained models are essential as feature extractors in modern machine learning systems in various...
In the past years, several hybridization techniques have been proposed to synthesize novel audio con...
Deep learning has fueled an explosion of applications, yet training deep neural networks usually req...
Methods for extracting audio and speech features have been studied since pioneering work on spectrum...
This dataset consists of two hdf5 files that contain pre-computed log-mel spectrograms that have bee...
Automatic music and audio tagging can help increase the retrieval and re-use possibilities of many a...
In music domain, feature learning has been conducted mainly in two ways: unsupervised learning based...
En Music Information Retrieval (MIR, ou recherche d'information musicales) et en traitement de la pa...
PhDIn this thesis, I present my hypothesis, experiment results, and discussion that are related to v...
Identifying musical instruments in a polyphonic music recording is a difficult yet crucial problem i...
The success of supervised deep learning methods is largely due to their ability to learn relevant fe...
Self-supervised audio representation learning offers an attractive alternative for obtaining generic...
Modeling various aspects that make a music piece unique is a challenging task, requiring the combina...
Modeling various aspects that make a music piece unique is a challenging task, requiring the combina...
Inspired by the recent progress in self-supervised learning for computer vision, in this paper we in...
Pre-trained models are essential as feature extractors in modern machine learning systems in various...
In the past years, several hybridization techniques have been proposed to synthesize novel audio con...
Deep learning has fueled an explosion of applications, yet training deep neural networks usually req...
Methods for extracting audio and speech features have been studied since pioneering work on spectrum...
This dataset consists of two hdf5 files that contain pre-computed log-mel spectrograms that have bee...
Automatic music and audio tagging can help increase the retrieval and re-use possibilities of many a...
In music domain, feature learning has been conducted mainly in two ways: unsupervised learning based...
En Music Information Retrieval (MIR, ou recherche d'information musicales) et en traitement de la pa...
PhDIn this thesis, I present my hypothesis, experiment results, and discussion that are related to v...
Identifying musical instruments in a polyphonic music recording is a difficult yet crucial problem i...
The success of supervised deep learning methods is largely due to their ability to learn relevant fe...