In this paper we present our audio tagging system for the DCASE 2019 Challenge Task 2. We propose a model consisting of a convolutional front end using log-mel-energies as input features, a recurrent neural network sequence encoder and a fully connected classifier network outputting an activity probability for each of the 80 considered event classes. Due to the recurrent neural network, which encodes a whole sequence into a single vector, our model is able to process sequences of varying lengths. The model is trained with only little manually labeled training data and a larger amount of automatically labeled web data, which hence suffers from label noise. To efficiently train the model with the provided data we use various data augmentation...
Audio tagging is the task of predicting the presence or absence of sound classes within an audio cli...
Comunicació presentada a: Workshop Machine Learning for Audio Signal Processing at NIPS 2017 (ML4Aud...
In this technique report, we present a bunch of methods for the task 4 of Detection and Classificati...
General-purpose audio tagging refers to classifying sounds that are of a diverse nature, and is rele...
This paper introduces Task 2 of the DCASE2019 Challenge, titled "Audio tagging with noisy labels and...
In this paper, we describe our system for the Task 2 of Detection and Classification of Acoustic Sce...
Audio tagging has attracted increasing attention since last decade and has various potential applic...
Audio tagging aims to perform multi-label classification on audio chunks and it is a newly proposed...
In this paper, we presented a neural network system for DCASE 2018 task 2, general purpose audio tag...
Audio tagging aims to detect the types of sound events occurring in an audio recording. To tag the p...
We present Music Tagging Transformer that is trained with a semi-supervised approach. The proposed m...
Label noise refers to the presence of inaccurate target labels in a dataset. It is an impediment to ...
In this paper, we present a gated convolutional neural network and a temporal attention-based locali...
Environmental audio tagging is a newly proposed task to predict the presence or absence of a specifi...
Label noise refers to the presence of inaccurate target labels in a dataset. It is an impediment to ...
Audio tagging is the task of predicting the presence or absence of sound classes within an audio cli...
Comunicació presentada a: Workshop Machine Learning for Audio Signal Processing at NIPS 2017 (ML4Aud...
In this technique report, we present a bunch of methods for the task 4 of Detection and Classificati...
General-purpose audio tagging refers to classifying sounds that are of a diverse nature, and is rele...
This paper introduces Task 2 of the DCASE2019 Challenge, titled "Audio tagging with noisy labels and...
In this paper, we describe our system for the Task 2 of Detection and Classification of Acoustic Sce...
Audio tagging has attracted increasing attention since last decade and has various potential applic...
Audio tagging aims to perform multi-label classification on audio chunks and it is a newly proposed...
In this paper, we presented a neural network system for DCASE 2018 task 2, general purpose audio tag...
Audio tagging aims to detect the types of sound events occurring in an audio recording. To tag the p...
We present Music Tagging Transformer that is trained with a semi-supervised approach. The proposed m...
Label noise refers to the presence of inaccurate target labels in a dataset. It is an impediment to ...
In this paper, we present a gated convolutional neural network and a temporal attention-based locali...
Environmental audio tagging is a newly proposed task to predict the presence or absence of a specifi...
Label noise refers to the presence of inaccurate target labels in a dataset. It is an impediment to ...
Audio tagging is the task of predicting the presence or absence of sound classes within an audio cli...
Comunicació presentada a: Workshop Machine Learning for Audio Signal Processing at NIPS 2017 (ML4Aud...
In this technique report, we present a bunch of methods for the task 4 of Detection and Classificati...