This work investigates into methods for words, word phrases and longer segments detection in large speech data sets in an unsupervised way. At first, basics for the given topic and principles of modern methods for searching of repeating objects are introduced. The representation and segmentation of the input data are described. Techniques for object detection in speech are presented. The description of found motifs modelling follows. The next step defi nes data sets for experiments in which spoken term detection by an example is performed. The system requirements are described. In the conclusion, the work is summarised and suggestions for further development are discussed
Field speech data pose great challenges to statistical modeling because the speech signal is often i...
International audienceProviding effective tools to navigate and access through long audio archives, ...
The technology for unlimited vocabulary automatic keyword spotting in spontaneous Russian speech is ...
This thesis explores unsupervised algorithms for pattern discovery and retrieval in audio and speech...
We present a novel approach to speech processing based on the principle of pattern discovery. Our wo...
In this paper, we present an unsupervised method for automatically discovering words from speech usi...
Thesis (Ph. D.)--Massachusetts Institute of Technology, Dept. of Electrical Engineering and Computer...
International audienceThis paper describes and evaluates a computational architecture to discover an...
Abstract—This article deals with universal sequential audio pattern search and sound recognition met...
International audienceIn real audio data, frequently occurring patterns often convey relevant inform...
We propose a new type of audio feature (HFCC-ENS) as well as an unsupervised method for detecting sh...
This thesis introduces a novel algorithm for phrase spotting and shows the advantages of using large...
This thesis aims a speaker independent spoken term detection (STD) using supervised technique.\ud Th...
This contribution presents the advances of the Fraunhofer IAIS Audiomining system for vocabulary ind...
National audienceWe propose a method to automatically discover repeating acoustic patterns in speech...
Field speech data pose great challenges to statistical modeling because the speech signal is often i...
International audienceProviding effective tools to navigate and access through long audio archives, ...
The technology for unlimited vocabulary automatic keyword spotting in spontaneous Russian speech is ...
This thesis explores unsupervised algorithms for pattern discovery and retrieval in audio and speech...
We present a novel approach to speech processing based on the principle of pattern discovery. Our wo...
In this paper, we present an unsupervised method for automatically discovering words from speech usi...
Thesis (Ph. D.)--Massachusetts Institute of Technology, Dept. of Electrical Engineering and Computer...
International audienceThis paper describes and evaluates a computational architecture to discover an...
Abstract—This article deals with universal sequential audio pattern search and sound recognition met...
International audienceIn real audio data, frequently occurring patterns often convey relevant inform...
We propose a new type of audio feature (HFCC-ENS) as well as an unsupervised method for detecting sh...
This thesis introduces a novel algorithm for phrase spotting and shows the advantages of using large...
This thesis aims a speaker independent spoken term detection (STD) using supervised technique.\ud Th...
This contribution presents the advances of the Fraunhofer IAIS Audiomining system for vocabulary ind...
National audienceWe propose a method to automatically discover repeating acoustic patterns in speech...
Field speech data pose great challenges to statistical modeling because the speech signal is often i...
International audienceProviding effective tools to navigate and access through long audio archives, ...
The technology for unlimited vocabulary automatic keyword spotting in spontaneous Russian speech is ...