In this paper, we present a gated convolutional neural network and a temporal attention-based localization method for audio classification, which won the 1st place in the large-scale weakly supervised sound event detection task of Detection and Classification of Acoustic Scenes and Events (DCASE) 2017 challenge. The audio clips in this task, which are extracted from YouTube videos, are manually labelled with one or more audio tags, but without time stamps of the audio events, hence referred to as weakly labelled data. Two subtasks are defined in this challenge including audio tagging and sound event detection using this weakly labelled data. We propose a convolutional recurrent neural network (CRNN) with learnable gated linear units (GLUs) ...
Environmental audio tagging is a newly proposed task to predict the presence or absence of a specifi...
International audienceThe design of new methods and models when only weakly-labeled data are availab...
Audio tagging aims to detect the types of sound events occurring in an audio recording. To tag the p...
In this paper, we present a gated convolutional neural network and a temporal attention-based local...
In this technique report, we present a bunch of methods for the task 4 of Detection and Classificati...
Audio tagging aims to perform multi-label classification on audio chunks and it is a newly proposed ...
International audienceIn this paper, we address the detection of audio events in domestic environmen...
Sound event detection (SED) is a task to detect sound events in an audio recording. One challenge of...
Multiple instance learning (MIL) with convolutional neural networks (CNNs) has been proposed recentl...
Sound event detection (SED) is a problem to detect the onset and offset times of sound events in an ...
Sound event detection (SED) aims to detect when and recognize what sound events happen in an audio c...
In this paper, we present a method called HODGEPODGE\footnotemark[1] for large-scale detection of so...
In this paper, we propose a multi-level attention model for the weakly labelled audio classification...
Audio tagging is the task of predicting the presence or absence of sound classes within an audio cli...
Weakly labeled sound event detection (WSED) is an important task as it can facilitate the data colle...
Environmental audio tagging is a newly proposed task to predict the presence or absence of a specifi...
International audienceThe design of new methods and models when only weakly-labeled data are availab...
Audio tagging aims to detect the types of sound events occurring in an audio recording. To tag the p...
In this paper, we present a gated convolutional neural network and a temporal attention-based local...
In this technique report, we present a bunch of methods for the task 4 of Detection and Classificati...
Audio tagging aims to perform multi-label classification on audio chunks and it is a newly proposed ...
International audienceIn this paper, we address the detection of audio events in domestic environmen...
Sound event detection (SED) is a task to detect sound events in an audio recording. One challenge of...
Multiple instance learning (MIL) with convolutional neural networks (CNNs) has been proposed recentl...
Sound event detection (SED) is a problem to detect the onset and offset times of sound events in an ...
Sound event detection (SED) aims to detect when and recognize what sound events happen in an audio c...
In this paper, we present a method called HODGEPODGE\footnotemark[1] for large-scale detection of so...
In this paper, we propose a multi-level attention model for the weakly labelled audio classification...
Audio tagging is the task of predicting the presence or absence of sound classes within an audio cli...
Weakly labeled sound event detection (WSED) is an important task as it can facilitate the data colle...
Environmental audio tagging is a newly proposed task to predict the presence or absence of a specifi...
International audienceThe design of new methods and models when only weakly-labeled data are availab...
Audio tagging aims to detect the types of sound events occurring in an audio recording. To tag the p...