Artificial sound event detection (SED) has the aim to mimic the human ability to perceive and understand what is happening in the surroundings. Nowadays, deep learning offers valuable techniques for this goal such as convolutional neural networks (CNNs). The capsule neural network (CapsNet) architecture has been recently introduced in the image processing field with the intent to overcome some of the known limitations of CNNs, specifically regarding the scarce robustness to affine transformations (i.e., perspective, size, orientation) and the detection of overlapped images. This motivated the authors to employ CapsNets to deal with the polyphonic SED task, in which multiple sound events occur simultaneously. Specifically, we propose to expl...
We applied various architectures of deep neural networks for sound event detection and compared thei...
Polyphonic sound event localization and detection (SELD), which jointly performs sound event detecti...
Everyday environments are overflowed with a wide variety of acoustic events, either produced by huma...
Artificial sound event detection (SED) has the aim to mimic the human ability to perceive and unders...
Polyphonic sound event detection aims to detect the types of sound events that occur in given audio ...
The detection of acoustic scenes is a challenging problem in which environmental sound events must b...
In recent decades, surveillance and home security systems based on video analysis have been proposed...
Sound events often occur in unstructured environments where they exhibit wide variations in their fr...
The objective of this thesis is to investigate how a deep learning model called recurrent neural net...
The objective of this thesis is to develop novel classification and feature learning techniques for t...
There are multiple sound events simultaneously occuring in a real-life audio recording collected e.g...
Polyphonic sound event localization and detection is not only detecting what sound events are happen...
In this thesis, we present novel sound representations and classification methods for the task of so...
To detect the class, and start and end times of sound events in real world recordings is a challengi...
Sound event detection (SED) and localization refer to recognizing sound events and estimating their ...
We applied various architectures of deep neural networks for sound event detection and compared thei...
Polyphonic sound event localization and detection (SELD), which jointly performs sound event detecti...
Everyday environments are overflowed with a wide variety of acoustic events, either produced by huma...
Artificial sound event detection (SED) has the aim to mimic the human ability to perceive and unders...
Polyphonic sound event detection aims to detect the types of sound events that occur in given audio ...
The detection of acoustic scenes is a challenging problem in which environmental sound events must b...
In recent decades, surveillance and home security systems based on video analysis have been proposed...
Sound events often occur in unstructured environments where they exhibit wide variations in their fr...
The objective of this thesis is to investigate how a deep learning model called recurrent neural net...
The objective of this thesis is to develop novel classification and feature learning techniques for t...
There are multiple sound events simultaneously occuring in a real-life audio recording collected e.g...
Polyphonic sound event localization and detection is not only detecting what sound events are happen...
In this thesis, we present novel sound representations and classification methods for the task of so...
To detect the class, and start and end times of sound events in real world recordings is a challengi...
Sound event detection (SED) and localization refer to recognizing sound events and estimating their ...
We applied various architectures of deep neural networks for sound event detection and compared thei...
Polyphonic sound event localization and detection (SELD), which jointly performs sound event detecti...
Everyday environments are overflowed with a wide variety of acoustic events, either produced by huma...