This paper revisits the idea of music representation learning supervised by editorial metadata, contributing to the state of the art in two ways. First, we exploit the public editorial metadata available on Discogs, an extensive community-maintained music database containing information about artists, releases, and record labels. Second, we use a contrastive learning setup based on COLA, different from previous systems based on triplet loss. We train models targeting several associations derived from the metadata and experiment with stacked combinations of learned representations, evaluating them on standard music classification tasks. Additionally, we consider learning all the associations jointly in a multi-task setup. We show that it is ...
We demonstrate that language models pre-trained on codified (discretely-encoded) music audio learn r...
Deep representation learning offers a powerful paradigm for mapping input data onto an organized emb...
Music streaming platforms rely heavily on learning meaningful representations of tracks to surface a...
This work has been accepted at the 23rd International Society for Music Information Retrieval Confer...
In music domain, feature learning has been conducted mainly in two ways: unsupervised learning based...
While deep learning has enabled great advances in many areas of music, labeled music datasets remain...
In this thesis, we address the problems of classifying and recommending music present in large colle...
Modeling various aspects that make a music piece unique is a challenging task, requiring the combina...
Modeling various aspects that make a music piece unique is a challenging task, requiring the combina...
In this work, we provide a broad comparative analysis of strategies for pre-training audio understan...
In this thesis, we address the problems of classifying and recommending music present in large colle...
Comunicació presentada a: ISMIR 2017, celebrat a Suzhou, Xina, del 23 al 27 d'octubre de 2017While a...
In this work we propose a set of new automatic text augmentations that leverage Large Language Model...
Abstract. In this work we propose a novel approach to music recommendation based exclusively on edit...
Inspired by the success of deploying deep learning in the fields of Computer Vision and Natural Lang...
We demonstrate that language models pre-trained on codified (discretely-encoded) music audio learn r...
Deep representation learning offers a powerful paradigm for mapping input data onto an organized emb...
Music streaming platforms rely heavily on learning meaningful representations of tracks to surface a...
This work has been accepted at the 23rd International Society for Music Information Retrieval Confer...
In music domain, feature learning has been conducted mainly in two ways: unsupervised learning based...
While deep learning has enabled great advances in many areas of music, labeled music datasets remain...
In this thesis, we address the problems of classifying and recommending music present in large colle...
Modeling various aspects that make a music piece unique is a challenging task, requiring the combina...
Modeling various aspects that make a music piece unique is a challenging task, requiring the combina...
In this work, we provide a broad comparative analysis of strategies for pre-training audio understan...
In this thesis, we address the problems of classifying and recommending music present in large colle...
Comunicació presentada a: ISMIR 2017, celebrat a Suzhou, Xina, del 23 al 27 d'octubre de 2017While a...
In this work we propose a set of new automatic text augmentations that leverage Large Language Model...
Abstract. In this work we propose a novel approach to music recommendation based exclusively on edit...
Inspired by the success of deploying deep learning in the fields of Computer Vision and Natural Lang...
We demonstrate that language models pre-trained on codified (discretely-encoded) music audio learn r...
Deep representation learning offers a powerful paradigm for mapping input data onto an organized emb...
Music streaming platforms rely heavily on learning meaningful representations of tracks to surface a...