In this work, we adopt an information theoretic approach- the Information Bottleneck method- to extract the relevant spectro-temporal modulations for the task of speech / non-speech dis-crimination- non-speech events include music, noise and an-imal vocalizations. A compact representation (a “cluster pro-totype”) is built for each class consisting of the maximally in-formative features with respect to the classification task. We assess the similarity of a sound to each representative cluster using the spectro-temporal modulation index (STMI) adapted to handle the contribution of different frequency bands. A sim-ple threshold check is then used for discriminating speech from non-speech events. Conducted experiments have shown that the propos...
This paper proposes a security-monitoring instrument that can detect and classify the location and n...
omer La. tudent.unsw.edu.au 1 ambi,(a,unsw.edu.aU tacmaeJIbD C u ienesLdni Abstract- Speech and musi...
This paper describes a proposed algorithm for speech/music discrimination, which works on data direc...
In this work, we adopt an information theoretic approach- the Information Bottleneck method- to extr...
We describe a content-based audio classification algorithm based on novel multiscale spectro-tempora...
We describe a content-based audio classification algorithm based on novel multiscale spectro-tempora...
A novel approach for content based audio classification is presented based on multiscale spectro-tem...
This thesis addresses the problem of classifying an audio stream as either speech or music, an issue...
A classification method is presented that detects the presence of speech embedded in a real acoustic...
Abstract: This paper describes a proposed algorithm for speech/music discrimination, which works on ...
This work is devoted to the problem of automatic speech and music discrim-ination. As we will see he...
In this paper, we analyze the temporal modulation char-acteristics of speech and noise from a speech...
Driven by the demand of information retrieval, video editing and human-computer interface, in this p...
International audienceMost of speech/music discrimination techniques proposed in the literature need...
The human auditory system is very well matched to both hu-man speech and environmental sounds. There...
This paper proposes a security-monitoring instrument that can detect and classify the location and n...
omer La. tudent.unsw.edu.au 1 ambi,(a,unsw.edu.aU tacmaeJIbD C u ienesLdni Abstract- Speech and musi...
This paper describes a proposed algorithm for speech/music discrimination, which works on data direc...
In this work, we adopt an information theoretic approach- the Information Bottleneck method- to extr...
We describe a content-based audio classification algorithm based on novel multiscale spectro-tempora...
We describe a content-based audio classification algorithm based on novel multiscale spectro-tempora...
A novel approach for content based audio classification is presented based on multiscale spectro-tem...
This thesis addresses the problem of classifying an audio stream as either speech or music, an issue...
A classification method is presented that detects the presence of speech embedded in a real acoustic...
Abstract: This paper describes a proposed algorithm for speech/music discrimination, which works on ...
This work is devoted to the problem of automatic speech and music discrim-ination. As we will see he...
In this paper, we analyze the temporal modulation char-acteristics of speech and noise from a speech...
Driven by the demand of information retrieval, video editing and human-computer interface, in this p...
International audienceMost of speech/music discrimination techniques proposed in the literature need...
The human auditory system is very well matched to both hu-man speech and environmental sounds. There...
This paper proposes a security-monitoring instrument that can detect and classify the location and n...
omer La. tudent.unsw.edu.au 1 ambi,(a,unsw.edu.aU tacmaeJIbD C u ienesLdni Abstract- Speech and musi...
This paper describes a proposed algorithm for speech/music discrimination, which works on data direc...