The bag-of-audio-words approach has been widely used for audio event recognition. In these models, a local fea-ture of an audio signal is matched to a code word according to a learned codebook. The signal is then represented by frequencies of the matched code words on the whole sig-nal. We present in this paper an improved model based on the idea of audio phrases which are sequences of multiple audio words. By using audio phrases, we are able to cap-ture the relationship between the isolated audio words and produce more semantic descriptors. Furthermore, we also propose an efficient approach to learn a compact codebook in a discriminative manner to deal with high-dimensionality of bag-of-audio-phrases representations. Experiments on the Fre...
The research domain of automatic sound event recognition aims to describe an audio signal in terms o...
Abstract Detecting complex events in videos is intrin-sically a multimodal problem since both audio ...
In this paper, we propose a generative model-based approach for audio-visual event classification. T...
The bag-of-audio-words approach has been widely used for audio event recognition. In these models, a...
In this paper we propose a novel approach for the audio-based detection of events. The approach adop...
In massive online audio databases such as Freesound, automatic methods to encode, process, compare a...
In this paper we present our audio based system for detecting "events" within consumer videos (e.g. ...
This paper presents a method for audio context recognition, meaning classification between everyday ...
In this paper we propose a novel method for the detection of audio events for surveillance applicati...
International audienceIn real audio data, frequently occurring patterns often convey relevant inform...
This paper describes a method for environmental audio events analysis. The audio events are modeled ...
Abstract Recently, sound recognition has been used to identify sounds, such as the sound of a car, o...
The human auditory system is very well matched to both hu-man speech and environmental sounds. There...
Sound analysis research has mainly been focused on speech and music processing. The deployed methodo...
This paper presents a system that uses symbolic representa-tions of audio concepts as words for the ...
The research domain of automatic sound event recognition aims to describe an audio signal in terms o...
Abstract Detecting complex events in videos is intrin-sically a multimodal problem since both audio ...
In this paper, we propose a generative model-based approach for audio-visual event classification. T...
The bag-of-audio-words approach has been widely used for audio event recognition. In these models, a...
In this paper we propose a novel approach for the audio-based detection of events. The approach adop...
In massive online audio databases such as Freesound, automatic methods to encode, process, compare a...
In this paper we present our audio based system for detecting "events" within consumer videos (e.g. ...
This paper presents a method for audio context recognition, meaning classification between everyday ...
In this paper we propose a novel method for the detection of audio events for surveillance applicati...
International audienceIn real audio data, frequently occurring patterns often convey relevant inform...
This paper describes a method for environmental audio events analysis. The audio events are modeled ...
Abstract Recently, sound recognition has been used to identify sounds, such as the sound of a car, o...
The human auditory system is very well matched to both hu-man speech and environmental sounds. There...
Sound analysis research has mainly been focused on speech and music processing. The deployed methodo...
This paper presents a system that uses symbolic representa-tions of audio concepts as words for the ...
The research domain of automatic sound event recognition aims to describe an audio signal in terms o...
Abstract Detecting complex events in videos is intrin-sically a multimodal problem since both audio ...
In this paper, we propose a generative model-based approach for audio-visual event classification. T...