In this paper, we study the use of soft labels to train a system for sound event detection (SED). Soft labels can result from annotations which account for human uncertainty about categories, or emerge as a natural representation of multiple opinions in annotation. Converting annotations to hard labels results in unambiguous categories for training, at the cost of losing the details about the labels distribution. This work investigates how soft labels can be used, and what benefits they bring in training a SED system. The results show that the system is capable of learning information about the activity of the sounds which is reflected in the soft labels and is able to detect sounds that are missed in the typical binary target training setu...
The objective of this thesis is to develop novel classification and feature learning techniques for t...
Sound recognition systems aim to determine what source produced a sound event. Until now, such syste...
International audienceTraining a sound event detection algorithm on a heterogeneous dataset includin...
Strong labels are a necessity for evaluation of sound event detection methods, but often scarcely av...
Crowdsourcing is a popular tool for collecting large amounts of annotated data, but the specific for...
International audienceThis paper proposes an overview of the latest advances and challenges in sound...
Comunicació presentada a: ICASSP 2019 - 2019 IEEE International Conference on Acoustics, Speech and ...
Everyday environments are overflowed with a wide variety of acoustic events, either produced by huma...
Sound event detection (SED) is a problem to detect the onset and offset times of sound events in an ...
The dataset was created for studying estimation of strong labels using crowdsourcing. It contains 2...
The objective of the thesis is to develop techniques that optimize the performances of sound event d...
Weakly labeled sound event detection (WSED) is an important task as it can facilitate the data colle...
This repo contains the pre-trained models, annotations, and additional results related to the paper ...
Weak labels are a recurring problem in the context of ambient sound analysis. While multiple methods...
Sound event detection (SED) aims to detect when and recognize what sound events happen in an audio c...
The objective of this thesis is to develop novel classification and feature learning techniques for t...
Sound recognition systems aim to determine what source produced a sound event. Until now, such syste...
International audienceTraining a sound event detection algorithm on a heterogeneous dataset includin...
Strong labels are a necessity for evaluation of sound event detection methods, but often scarcely av...
Crowdsourcing is a popular tool for collecting large amounts of annotated data, but the specific for...
International audienceThis paper proposes an overview of the latest advances and challenges in sound...
Comunicació presentada a: ICASSP 2019 - 2019 IEEE International Conference on Acoustics, Speech and ...
Everyday environments are overflowed with a wide variety of acoustic events, either produced by huma...
Sound event detection (SED) is a problem to detect the onset and offset times of sound events in an ...
The dataset was created for studying estimation of strong labels using crowdsourcing. It contains 2...
The objective of the thesis is to develop techniques that optimize the performances of sound event d...
Weakly labeled sound event detection (WSED) is an important task as it can facilitate the data colle...
This repo contains the pre-trained models, annotations, and additional results related to the paper ...
Weak labels are a recurring problem in the context of ambient sound analysis. While multiple methods...
Sound event detection (SED) aims to detect when and recognize what sound events happen in an audio c...
The objective of this thesis is to develop novel classification and feature learning techniques for t...
Sound recognition systems aim to determine what source produced a sound event. Until now, such syste...
International audienceTraining a sound event detection algorithm on a heterogeneous dataset includin...