In this paper, we propose a stacked convolutional and recurrent neural network (CRNN) with a 3D convolutional neural network (CNN) in the first layer for the multichannel sound event detection (SED) task. The 3D CNN enables the network to simultaneously learn the inter-and intra-channel features from the input multichannel audio. In order to evaluate the proposed method, multichannel audio datasets with different number of overlapping sound sources are synthesized. Each of this dataset has a four-channel first-order Ambisonic, binaural, and single-channel versions, on which the performance of SED using the proposed method are compared to study the potential of SED using multichannel audio. A similar study is also done with the binaural and ...
The objective of this thesis is to develop novel classification and feature learning techniques for t...
In real-life audio scenes, many sound events from differ-ent sources are simultaneously active, whic...
Automated analysis of complex scenes of everyday sounds might help us navigate within the enormous a...
In this paper, we propose a stacked convolutional and recurrent neural network (CRNN) with a 3D conv...
| openaire: EC/H2020/637422/EU//EVERYSOUNDIn this paper, we propose a convolutional recurrent neural...
This paper proposes to use low-level spatial features extracted from multichannel audio for sound ev...
Recently, deep recurrent neural networks have achieved great success in various machine learning tas...
There are multiple sound events simultaneously occuring in a real-life audio recording collected e.g...
Sound events often occur in unstructured environments where they exhibit wide variations in their fr...
Sound event localization and detection (SELD) refers to the problem of identifying the presence of i...
This paper proposes sound event localization and detection methods from multichannel recording. The ...
We propose a multi-label multi-task framework based on a convolutional recurrent neural network to u...
Sound event detection (SED) has been widely applied in real world applications. Convolutional recurr...
In this thesis, we present novel sound representations and classification methods for the task of so...
The objective of this thesis is to investigate how a deep learning model called recurrent neural net...
The objective of this thesis is to develop novel classification and feature learning techniques for t...
In real-life audio scenes, many sound events from differ-ent sources are simultaneously active, whic...
Automated analysis of complex scenes of everyday sounds might help us navigate within the enormous a...
In this paper, we propose a stacked convolutional and recurrent neural network (CRNN) with a 3D conv...
| openaire: EC/H2020/637422/EU//EVERYSOUNDIn this paper, we propose a convolutional recurrent neural...
This paper proposes to use low-level spatial features extracted from multichannel audio for sound ev...
Recently, deep recurrent neural networks have achieved great success in various machine learning tas...
There are multiple sound events simultaneously occuring in a real-life audio recording collected e.g...
Sound events often occur in unstructured environments where they exhibit wide variations in their fr...
Sound event localization and detection (SELD) refers to the problem of identifying the presence of i...
This paper proposes sound event localization and detection methods from multichannel recording. The ...
We propose a multi-label multi-task framework based on a convolutional recurrent neural network to u...
Sound event detection (SED) has been widely applied in real world applications. Convolutional recurr...
In this thesis, we present novel sound representations and classification methods for the task of so...
The objective of this thesis is to investigate how a deep learning model called recurrent neural net...
The objective of this thesis is to develop novel classification and feature learning techniques for t...
In real-life audio scenes, many sound events from differ-ent sources are simultaneously active, whic...
Automated analysis of complex scenes of everyday sounds might help us navigate within the enormous a...