Supervised machine learning relies on the accessibility of large datasets of annotated data. This is essential since small datasets generally lead to overfitting when training high-dimensional machine-learning models. Since the manual annotation of such large datasets is a long, tedious and expensive process, another possibility is to artificially increase the size of the dataset. This is known as data augmentation. In this paper we provide an in-depth analysis of two data augmentation methods: sound transformations and sound segmentation. The first transforms a music track to a set of new music tracks by applying processes such as pitch-shifting, time-stretching or filtering. The second one splits a long sound signal into a set of shorter ...
Genre provides one of the most convenient categorizations of music, but it is often regarded as a po...
Identifying musical instruments in a polyphonic music recording is a difficult yet crucial problem i...
Everyday environments are overflowed with a wide variety of acoustic events, either produced by huma...
Predictive models for music annotation tasks are practi-cally limited by a paucity of well-annotated...
Comunicació presentada a la 22a International Conference on Digital Audio Effects (DAFx-19) que se c...
There exists a large number of supervised music classification tasks: Recognition of music genres an...
Comunicació presentada a la 22a International Conference on Digital Audio Effects (DAFx-19) que se c...
Identifying musical instruments in a polyphonic music recording is a difficult yet crucial problem i...
In computer vision, state-of-the-art object recognition sys-tems rely on label-preserving image tran...
In computer vision, state-of-the-art object recognition sys-tems rely on label-preserving image tran...
Data cleansing is a well studied strategy for cleaning erroneous labels in datasets, which has not y...
abstract: Modern audio datasets and machine learning software tools have given researchers a deep un...
The aim of this systematic literature review (SLR) is to identify and critically evaluate current re...
In this work, we provide a broad comparative analysis of strategies for pre-training audio understan...
Deep learning models have recently led to significant improvements in a wide variety of tasks. Known...
Genre provides one of the most convenient categorizations of music, but it is often regarded as a po...
Identifying musical instruments in a polyphonic music recording is a difficult yet crucial problem i...
Everyday environments are overflowed with a wide variety of acoustic events, either produced by huma...
Predictive models for music annotation tasks are practi-cally limited by a paucity of well-annotated...
Comunicació presentada a la 22a International Conference on Digital Audio Effects (DAFx-19) que se c...
There exists a large number of supervised music classification tasks: Recognition of music genres an...
Comunicació presentada a la 22a International Conference on Digital Audio Effects (DAFx-19) que se c...
Identifying musical instruments in a polyphonic music recording is a difficult yet crucial problem i...
In computer vision, state-of-the-art object recognition sys-tems rely on label-preserving image tran...
In computer vision, state-of-the-art object recognition sys-tems rely on label-preserving image tran...
Data cleansing is a well studied strategy for cleaning erroneous labels in datasets, which has not y...
abstract: Modern audio datasets and machine learning software tools have given researchers a deep un...
The aim of this systematic literature review (SLR) is to identify and critically evaluate current re...
In this work, we provide a broad comparative analysis of strategies for pre-training audio understan...
Deep learning models have recently led to significant improvements in a wide variety of tasks. Known...
Genre provides one of the most convenient categorizations of music, but it is often regarded as a po...
Identifying musical instruments in a polyphonic music recording is a difficult yet crucial problem i...
Everyday environments are overflowed with a wide variety of acoustic events, either produced by huma...