Convolutional neural networks (CNN) have produced encouraging results in image classification tasks and have been increasingly adopted in audio classification applications. However, in using CNN for acoustic event recognition, the first hurdle is finding the best image representation of an audio signal. In this work, we evaluate the performance of four time-frequency representations for use with CNN. Firstly, we consider the conventional spectrogram image. Secondly, we apply moving average to the spectrogram along the frequency domain to obtain what we refer as the smoothed spectrogram. Thirdly, we use the mel-spectrogram which utilizes the mel-filter, as in mel-frequency cepstral coefficients. Finally, we propose the use of a cochleagram i...
Sound event recognition systems are rapidly becoming part of our life, since they can be profitably ...
As an important information carrier, sound carries abundant information about the environment, which...
The objective of this thesis is to develop novel classification and feature learning techniques for t...
Convolutional neural networks (CNN) are being increasingly used for audio signal classification appl...
This work proposes the use of pseudo-color cochleagram image of sound signals for feature extraction...
This paper presents a novel application of convolutional neural networks (CNNs) for the task of acou...
Convolutional Neural Networks (CNNs) have enabled significant improvements across a number of applic...
Sound events often occur in unstructured environments where they exhibit wide variations in their fr...
Research has shown the efficacy of using convolutional neural networks (CNN) with audio spectrograms...
In this paper, we present the details of our proposed framework and solution for the DCASE 2019 Task...
This paper explores the use of three different two-dimensional time-frequency features for audio eve...
Traditional sound event recognition methods based on informative front end features such as MFCC, wi...
Many real-world time series analysis problems are characterized by low signal-to-noise ratios and co...
In this thesis I apply a convolutional neural network to classify a large set of publicly available ...
We propose a convolutional neural network (CNN) model based on an attention pooling method to classi...
Sound event recognition systems are rapidly becoming part of our life, since they can be profitably ...
As an important information carrier, sound carries abundant information about the environment, which...
The objective of this thesis is to develop novel classification and feature learning techniques for t...
Convolutional neural networks (CNN) are being increasingly used for audio signal classification appl...
This work proposes the use of pseudo-color cochleagram image of sound signals for feature extraction...
This paper presents a novel application of convolutional neural networks (CNNs) for the task of acou...
Convolutional Neural Networks (CNNs) have enabled significant improvements across a number of applic...
Sound events often occur in unstructured environments where they exhibit wide variations in their fr...
Research has shown the efficacy of using convolutional neural networks (CNN) with audio spectrograms...
In this paper, we present the details of our proposed framework and solution for the DCASE 2019 Task...
This paper explores the use of three different two-dimensional time-frequency features for audio eve...
Traditional sound event recognition methods based on informative front end features such as MFCC, wi...
Many real-world time series analysis problems are characterized by low signal-to-noise ratios and co...
In this thesis I apply a convolutional neural network to classify a large set of publicly available ...
We propose a convolutional neural network (CNN) model based on an attention pooling method to classi...
Sound event recognition systems are rapidly becoming part of our life, since they can be profitably ...
As an important information carrier, sound carries abundant information about the environment, which...
The objective of this thesis is to develop novel classification and feature learning techniques for t...