We propose a new type of audio feature (HFCC-ENS) as well as an unsupervised method for detecting short sequences of spoken words (key-phrases) within long speech recordings. Our technical contributions are threefold: Firstly, we propose to use bandwidth-adapted filterbanks instead of classical MFCC-style filters in the feature extraction step. Secondly, the time resolution of the resulting features is adapted to account for the temporal characteristics of the spoken phrases. Thirdly, the key-phrase detection step is performed by matching sequences of the resulting HFCC-ENS features with features extracted from a target speech recording. We evaluate the proposed method using the German Kiel Corpus and furthermore investigate speech-related ...
Keyword Spotting (KWS) systems allow detecting a set of spoken (pre-defined) keywords. Open-vocabula...
International audienceThis paper describes and evaluates a computational architecture to discover an...
We present a novel approach to speech processing based on the principle of pattern discovery. Our wo...
This work investigates into methods for words, word phrases and longer segments detection in large s...
International audienceIn real audio data, frequently occurring patterns often convey relevant inform...
The bag-of-audio-words approach has been widely used for audio event recognition. In these models, a...
This thesis introduces a novel algorithm for phrase spotting and shows the advantages of using large...
In this article we propose two algorithms for discourse prosodic feature interpretation. The first a...
Modern devices are more frequently equipped with voice control. Using speech to operate is a natural...
This thesis presents work in research topics of audio detection. It first describes a system for lar...
The goal of this work is to automatically determine whether and when a word of interest is spoken by...
In this paper, we propose an A*-admissible key-phrase spotting framework, which needs little domain ...
Millions of user generated video blogs expressing opinions and feelings about products, events, news...
This thesis explores unsupervised algorithms for pattern discovery and retrieval in audio and speech...
This paper assesses the role of robust acoustic features in spoken term detection (a.k.a keyword spo...
Keyword Spotting (KWS) systems allow detecting a set of spoken (pre-defined) keywords. Open-vocabula...
International audienceThis paper describes and evaluates a computational architecture to discover an...
We present a novel approach to speech processing based on the principle of pattern discovery. Our wo...
This work investigates into methods for words, word phrases and longer segments detection in large s...
International audienceIn real audio data, frequently occurring patterns often convey relevant inform...
The bag-of-audio-words approach has been widely used for audio event recognition. In these models, a...
This thesis introduces a novel algorithm for phrase spotting and shows the advantages of using large...
In this article we propose two algorithms for discourse prosodic feature interpretation. The first a...
Modern devices are more frequently equipped with voice control. Using speech to operate is a natural...
This thesis presents work in research topics of audio detection. It first describes a system for lar...
The goal of this work is to automatically determine whether and when a word of interest is spoken by...
In this paper, we propose an A*-admissible key-phrase spotting framework, which needs little domain ...
Millions of user generated video blogs expressing opinions and feelings about products, events, news...
This thesis explores unsupervised algorithms for pattern discovery and retrieval in audio and speech...
This paper assesses the role of robust acoustic features in spoken term detection (a.k.a keyword spo...
Keyword Spotting (KWS) systems allow detecting a set of spoken (pre-defined) keywords. Open-vocabula...
International audienceThis paper describes and evaluates a computational architecture to discover an...
We present a novel approach to speech processing based on the principle of pattern discovery. Our wo...