This paper explores the use of three different two-dimensional time-frequency features for audio event classification with deep neural network back-end classifiers. The evaluations use spectrogram, cochleogram and constant-Q transform based images for classification of 50 classes of audio events in varying levels of acoustic background noise, revealing interesting performance patterns with respect to noise level, feature image type and classifier. Evidence is obtained that two well-performing features, the spectrogram and cochleogram, make use of information that is potentially complementary in the input features. Feature fusion is thus explored for each pair of features, as well as for all tested features. Results indicate that a fusion of...
Institute of Engineering Sciences The file attached to this record is the author's final peer rev...
The remarkable success of deep convolutional neural networks in image-related applications has led t...
There are multiple sound events simultaneously occuring in a real-life audio recording collected e.g...
The automatic recognition of sound events by computers is an important aspect of emerging applicatio...
Traditional sound event recognition methods based on informative front end features such as MFCC, wi...
Recognizing acoustic events is an intricate problem for a machine and an emerging field of research....
This research examines an audio surveillance application, one of the many applications of sound eve...
The automatic detection and recognition of sound events by computers is a requirement for a number o...
Sound event detection is an extension of the static auditory classification task into continuous env...
Convolutional neural networks (CNN) have produced encouraging results in image classification tasks ...
The objective of this thesis is to develop novel classification and feature learning techniques for t...
We describe ongoing research in developing audio classification systems that use a spiking silicon c...
Hearing sense has an important role in our daily lives. During the recent years, there has been many...
This work proposes the use of pseudo-color cochleagram image of sound signals for feature extraction...
Audio classification, as a set of important and challenging tasks, groups speech signals according t...
Institute of Engineering Sciences The file attached to this record is the author's final peer rev...
The remarkable success of deep convolutional neural networks in image-related applications has led t...
There are multiple sound events simultaneously occuring in a real-life audio recording collected e.g...
The automatic recognition of sound events by computers is an important aspect of emerging applicatio...
Traditional sound event recognition methods based on informative front end features such as MFCC, wi...
Recognizing acoustic events is an intricate problem for a machine and an emerging field of research....
This research examines an audio surveillance application, one of the many applications of sound eve...
The automatic detection and recognition of sound events by computers is a requirement for a number o...
Sound event detection is an extension of the static auditory classification task into continuous env...
Convolutional neural networks (CNN) have produced encouraging results in image classification tasks ...
The objective of this thesis is to develop novel classification and feature learning techniques for t...
We describe ongoing research in developing audio classification systems that use a spiking silicon c...
Hearing sense has an important role in our daily lives. During the recent years, there has been many...
This work proposes the use of pseudo-color cochleagram image of sound signals for feature extraction...
Audio classification, as a set of important and challenging tasks, groups speech signals according t...
Institute of Engineering Sciences The file attached to this record is the author's final peer rev...
The remarkable success of deep convolutional neural networks in image-related applications has led t...
There are multiple sound events simultaneously occuring in a real-life audio recording collected e.g...