This paper presents a strategy for efficiently selecting informative data from large corpora of transcribed speech. We propose to choose data uniformly according to the distribution of some target speech unit (phoneme, word, character, etc). In our experiment, in contrast to the common belief that “there is no data like more data”, we found it possible to select a highly informative subset of data that produces recognition performance comparable to a system that makes use of a much larger amount of data. At the same time, our selection process is efficient and fast. Index Terms — data selection, maximum entropy, speech recog-nition, acoustic modeling 1
We propose a simple yet effective method for improving speech recognition by reranking the N-best sp...
To obtain a robust acoustic model for a certain speech recognition task, a large amount of speech da...
In this thesis, research on large vocabulary continuous speech recognition for unknown audio conditi...
This paper presents a strategy for efficiently selecting informative data from large corpora of trans...
Automatic speech recognition (ASR) technology has matured over the past few decades and has made sig...
Contains fulltext : 75067.pdf (author's version ) (Open Access)ICSLP 2002, 16 sept...
In this work, I investigated structured approaches to data selection for speaker recognition, with a...
This paper presents an extended study in the topic of optimal selection of speech data from a databa...
This paper presents a data selection approach where spoken ut-terances are selected in a sequential ...
The task of a speech recogniser is to transcribe human speech into text. To do so, modern recogniser...
There is growing recognition of the importance of data-centric methods for building machine learning...
The demand of intelligent machines that may recognize the spoken speech and respond in a natural vo...
This thesis describes a designed and implemented system for efficient storage, indexing and search i...
© 2016 IEEE. Exemplar-based acoustic modeling is based on labeled training segments that are compare...
Colloque avec actes et comité de lecture. internationale.International audienceThis paper focuses on...
We propose a simple yet effective method for improving speech recognition by reranking the N-best sp...
To obtain a robust acoustic model for a certain speech recognition task, a large amount of speech da...
In this thesis, research on large vocabulary continuous speech recognition for unknown audio conditi...
This paper presents a strategy for efficiently selecting informative data from large corpora of trans...
Automatic speech recognition (ASR) technology has matured over the past few decades and has made sig...
Contains fulltext : 75067.pdf (author's version ) (Open Access)ICSLP 2002, 16 sept...
In this work, I investigated structured approaches to data selection for speaker recognition, with a...
This paper presents an extended study in the topic of optimal selection of speech data from a databa...
This paper presents a data selection approach where spoken ut-terances are selected in a sequential ...
The task of a speech recogniser is to transcribe human speech into text. To do so, modern recogniser...
There is growing recognition of the importance of data-centric methods for building machine learning...
The demand of intelligent machines that may recognize the spoken speech and respond in a natural vo...
This thesis describes a designed and implemented system for efficient storage, indexing and search i...
© 2016 IEEE. Exemplar-based acoustic modeling is based on labeled training segments that are compare...
Colloque avec actes et comité de lecture. internationale.International audienceThis paper focuses on...
We propose a simple yet effective method for improving speech recognition by reranking the N-best sp...
To obtain a robust acoustic model for a certain speech recognition task, a large amount of speech da...
In this thesis, research on large vocabulary continuous speech recognition for unknown audio conditi...