TUT Rare Sound events 2017, development dataset consists of source files for creating mixtures of rare sound events (classes baby cry, gun shot, glass break) with background audio, as well a set of readily generated mixtures and recipes for generating them. The "source" part of the dataset consists of two subsets: background recordings from 15 different acoustic scenes, recordings with the target rare sound events from three classes, accompanied by annotations of their temporal occurrences, a set of meta files providing the cross-validation setup: lists of background and target event recordings split into training and test subsets (called "devtrain" and "devtest", respectively, indicating they are provided as the development dataset, ...
FSD-FS is a publicly-available database of human labelled sound events for few-shot learning. It spa...
NIGENS (Neural Information Processing group GENeral sounds) is a database provided for sound-related...
TUT Sound events 2017, development dataset consists of 24 audio recordings from a single acoustic sc...
TUT Rare Sound events 2017, evaluation dataset consists of source files for creating mixtures of rar...
Contains artificial sound mixes and meta data that were created for the task of sound event detectio...
Tampere University of Technology (TUT) Sound Events 2018 - Ambisonic, Anechoic, and Synthetic Impuls...
Tampere University of Technology (TUT) Sound Events 2018 - Ambisonic, Reverberant and Synthetic Impu...
TAU-SEBin Binaural Sound Events 2021 is a dataset of synthetic binaural audio recordings, which cons...
Tampere University of Technology (TUT) Sound Events 2018 - Circular array, Reverberant and Synthetic...
TUT Sound events 2016, development dataset consists of 22 audio recordings from two acoustic scenes:...
Tampere University of Technology (TUT) Sound Events 2018 - Circular array, Anechoic and Synthetic Im...
VOICe: A novel dataset for the development and evaluation of generalizable sound event detection dom...
Tampere University of Technology (TUT) Sound Events 2018 - Ambisonic, Reverberant and Real-life Impu...
The IDMT-DESED-FL and IDMT-URBAN-FL datasets enable research in sound event detection (SED) within a...
DESCRIPTION: This audio dataset serves serves as supplementary material for the DCASE2022 Challenge...
FSD-FS is a publicly-available database of human labelled sound events for few-shot learning. It spa...
NIGENS (Neural Information Processing group GENeral sounds) is a database provided for sound-related...
TUT Sound events 2017, development dataset consists of 24 audio recordings from a single acoustic sc...
TUT Rare Sound events 2017, evaluation dataset consists of source files for creating mixtures of rar...
Contains artificial sound mixes and meta data that were created for the task of sound event detectio...
Tampere University of Technology (TUT) Sound Events 2018 - Ambisonic, Anechoic, and Synthetic Impuls...
Tampere University of Technology (TUT) Sound Events 2018 - Ambisonic, Reverberant and Synthetic Impu...
TAU-SEBin Binaural Sound Events 2021 is a dataset of synthetic binaural audio recordings, which cons...
Tampere University of Technology (TUT) Sound Events 2018 - Circular array, Reverberant and Synthetic...
TUT Sound events 2016, development dataset consists of 22 audio recordings from two acoustic scenes:...
Tampere University of Technology (TUT) Sound Events 2018 - Circular array, Anechoic and Synthetic Im...
VOICe: A novel dataset for the development and evaluation of generalizable sound event detection dom...
Tampere University of Technology (TUT) Sound Events 2018 - Ambisonic, Reverberant and Real-life Impu...
The IDMT-DESED-FL and IDMT-URBAN-FL datasets enable research in sound event detection (SED) within a...
DESCRIPTION: This audio dataset serves serves as supplementary material for the DCASE2022 Challenge...
FSD-FS is a publicly-available database of human labelled sound events for few-shot learning. It spa...
NIGENS (Neural Information Processing group GENeral sounds) is a database provided for sound-related...
TUT Sound events 2017, development dataset consists of 24 audio recordings from a single acoustic sc...