This electronic version was submitted by the student author. The certified thesis is available in the Institute Archives and Special Collections.Thesis: M. Eng., Massachusetts Institute of Technology, Department of Electrical Engineering and Computer Science, 2019Cataloged from student-submitted PDF version of thesis.Includes bibliographical references (pages 63-66).Many of the recent advances in audio event detection, particularly on the AudioSet dataset, have focused on improving performance using the released embeddings produced by a pre-trained model. In this work, we instead study the task of training a multi-label event classifier directly from the audio recordings of AudioSet. Using the audio recordings, not only are we able to repro...
Audio information retrieval has been a popular research subject over the last decades and being a su...
International audienceEach edition of the challenge on Detection and Classification of Acoustic Scen...
In this paper, we present a gated convolutional neural network and a temporal attention-based local...
As an important information carrier, sound carries abundant information about the environment, which...
As an important information carrier, sound carries abundant information about the environment, which...
There are multiple sound events simultaneously occuring in a real-life audio recording collected e.g...
Whether crossing the road or enjoying a concert, sound carries important information about the world...
In this thesis we investigate the use of deep neural networks applied to the field of computational a...
This work proposes bag-of-features deep learning models for acoustic scene classi?cation (ASC) – ide...
Everyday environments are overflowed with a wide variety of acoustic events, either produced by huma...
Research work on automatic speech recognition and automatic music transcription has been around for ...
Acoustic scene analysis (ASA) relies on the dynamic sensing and understanding of stationary and non-...
Each edition of the challenge on Detection and Classification of Acoustic Scenes and Events (DCASE) ...
We study the merit of transfer learning for two sound recognition problems, i.e., audio tagging and ...
In this paper, we present a method called HODGEPODGE\footnotemark[1] for large-scale detection of so...
Audio information retrieval has been a popular research subject over the last decades and being a su...
International audienceEach edition of the challenge on Detection and Classification of Acoustic Scen...
In this paper, we present a gated convolutional neural network and a temporal attention-based local...
As an important information carrier, sound carries abundant information about the environment, which...
As an important information carrier, sound carries abundant information about the environment, which...
There are multiple sound events simultaneously occuring in a real-life audio recording collected e.g...
Whether crossing the road or enjoying a concert, sound carries important information about the world...
In this thesis we investigate the use of deep neural networks applied to the field of computational a...
This work proposes bag-of-features deep learning models for acoustic scene classi?cation (ASC) – ide...
Everyday environments are overflowed with a wide variety of acoustic events, either produced by huma...
Research work on automatic speech recognition and automatic music transcription has been around for ...
Acoustic scene analysis (ASA) relies on the dynamic sensing and understanding of stationary and non-...
Each edition of the challenge on Detection and Classification of Acoustic Scenes and Events (DCASE) ...
We study the merit of transfer learning for two sound recognition problems, i.e., audio tagging and ...
In this paper, we present a method called HODGEPODGE\footnotemark[1] for large-scale detection of so...
Audio information retrieval has been a popular research subject over the last decades and being a su...
International audienceEach edition of the challenge on Detection and Classification of Acoustic Scen...
In this paper, we present a gated convolutional neural network and a temporal attention-based local...