AReN:A Deep Learning Approach for Sound Event Recognition Using a Brain Inspired Representation

Greco, Antonio
Petkov, Nicolai
Saggese, Alessia
Vento, Mario

Open PDF

Open link

Publication date

January 2020

DOI

10.1109/TIFS.2020.2994740

Language

English

Abstract

Audio surveillance is gaining in the last years wide interest. This is due to the large number of situations in which this kind of systems can be used, either alone or combined with video-based algorithms. In this paper we propose a deep learning method to automatically recognize events of interest in the context of audio surveillance (namely screams, broken glasses and gun shots). The audio stream is represented by a gammatonegram image. We propose a 21-layer CNN to which we feed sections of the gammatonegram representation. At the output of this CNN there are units that correspond to the classes. We trained the CNN, called AReN, by taking advantage of a problem-driven data augmentation, which extends the training dataset with gammatonegra...

Extracted data

We use cookies to provide a better user experience.

Data Protection

AReN:A Deep Learning Approach for Sound Event Recognition Using a Brain Inspired Representation

Abstract

Extracted data

AReN:A Deep Learning Approach for Sound Event Recognition Using a Brain Inspired Representation

Abstract

Extracted data

Related items

Related items