The majority of the recent works that address the interpretability of raw waveform based deep neural networks (DNNs) for audio processing focus on interpreting spectral and frequency response information, often limiting to visual and signal theoretic means of interpretation, solely for the first layer. This work proposes sonification, a method to interpret intermediate feature representations of sound event recognition (SER) 1D-convolutional neural networks (1D-CNNs) trained on raw waveforms by mapping these representations back into the discrete-time input signal domain, highlighting substructures in the input that maximally activate a feature map as intelligible acoustic events. Sonification is used to compare supervised and contrastive s...
Deep learning models have improved cutting-edge technologies in many research areas, but their black...
Audio classification, as a set of important and challenging tasks, groups speech signals according t...
Speaker Recognition (SR) is a common task in AI-based sound analysis, involving structurally differe...
The main objective of this work is to investigate how a deep convolutional neural network (CNN) perf...
Machine hearing of the environmental sound is one of the important issues in the audio recognition d...
Deep neural networks have been recently shown to capture intricate information transformation of sig...
As an important information carrier, sound carries abundant information about the environment, which...
The rise of deep learning algorithms has led many researchers to withdraw from using classic signal ...
Recognizing acoustic events is an intricate problem for a machine and an emerging field of research....
There are multiple sound events simultaneously occuring in a real-life audio recording collected e.g...
Deep learning models have improved cutting-edge technologies in many research areas, but their black...
Recently, labeling of acoustic events has emerged as an active topic covering a wide range of applic...
In this thesis we investigate the use of deep neural networks applied to the field of computational a...
The objective of this thesis is to develop novel classification and feature learning techniques for t...
Speech emotion recognition is a challenging task in speech processing field. For this reason, featur...
Deep learning models have improved cutting-edge technologies in many research areas, but their black...
Audio classification, as a set of important and challenging tasks, groups speech signals according t...
Speaker Recognition (SR) is a common task in AI-based sound analysis, involving structurally differe...
The main objective of this work is to investigate how a deep convolutional neural network (CNN) perf...
Machine hearing of the environmental sound is one of the important issues in the audio recognition d...
Deep neural networks have been recently shown to capture intricate information transformation of sig...
As an important information carrier, sound carries abundant information about the environment, which...
The rise of deep learning algorithms has led many researchers to withdraw from using classic signal ...
Recognizing acoustic events is an intricate problem for a machine and an emerging field of research....
There are multiple sound events simultaneously occuring in a real-life audio recording collected e.g...
Deep learning models have improved cutting-edge technologies in many research areas, but their black...
Recently, labeling of acoustic events has emerged as an active topic covering a wide range of applic...
In this thesis we investigate the use of deep neural networks applied to the field of computational a...
The objective of this thesis is to develop novel classification and feature learning techniques for t...
Speech emotion recognition is a challenging task in speech processing field. For this reason, featur...
Deep learning models have improved cutting-edge technologies in many research areas, but their black...
Audio classification, as a set of important and challenging tasks, groups speech signals according t...
Speaker Recognition (SR) is a common task in AI-based sound analysis, involving structurally differe...