When developing a speech recognition system, one must start by deciding what the units to be recognized should be. This is for the most part a straightforward choice in the case of word-based languages uch as English, but becomes an issue even in handling languages with a complex compounding system like German; with an agglutinative language like Japanese, which provides no spaces in written text, the choice is not at all obvious. Once an appropri-ate unit has been determined, the problem of consistently segment-ing transcriptions of training data must be addressed. This paper describes a method for learning a lexicon from a training corpus which contains no word-level segmentation, applied to the prob-lem of building a Japanese speech reco...
Models of the acquisition of word segmen-tation are typically evaluated using phonem-ically transcri...
This paper considers discriminative training of language models for large vocabulary continuous spee...
International audience— In this paper we present an integrated unsupervised method to produce a qual...
When developing a speech recognition system, one must start by deciding what the units to be recogni...
We describe an automatic process for learning word units in Japanese. Since the Japanese orthography...
We attemped to improve recognition accuracy by reduc-ing the inadequacies of the lexicon and languag...
If the objective of a Continuous Automatic Speech Understanding system is not a speech-to-text trans...
Natural language processing systems such as speech recognition and ma-chine translation conventional...
Design issues of a spontaneous speech corpus is described. The corpus under compilation will contain...
In the speech recognition of highly inflecting or compounding languages, the traditional word-based ...
We describe the protocol used for collecting a corpus of conversational English speech from non-nati...
One of the thorniest problems of large vocabulary continuous speech recognition systems is the large...
A significant cost in obtaining acoustic training data is the generation of accurate transcriptions....
To investigate problems of spontaneous speech recognition using N-grams and HMMs and estimate the ro...
This paper presents a method for reducing the effort of transcribing user utterances to develop lang...
Models of the acquisition of word segmen-tation are typically evaluated using phonem-ically transcri...
This paper considers discriminative training of language models for large vocabulary continuous spee...
International audience— In this paper we present an integrated unsupervised method to produce a qual...
When developing a speech recognition system, one must start by deciding what the units to be recogni...
We describe an automatic process for learning word units in Japanese. Since the Japanese orthography...
We attemped to improve recognition accuracy by reduc-ing the inadequacies of the lexicon and languag...
If the objective of a Continuous Automatic Speech Understanding system is not a speech-to-text trans...
Natural language processing systems such as speech recognition and ma-chine translation conventional...
Design issues of a spontaneous speech corpus is described. The corpus under compilation will contain...
In the speech recognition of highly inflecting or compounding languages, the traditional word-based ...
We describe the protocol used for collecting a corpus of conversational English speech from non-nati...
One of the thorniest problems of large vocabulary continuous speech recognition systems is the large...
A significant cost in obtaining acoustic training data is the generation of accurate transcriptions....
To investigate problems of spontaneous speech recognition using N-grams and HMMs and estimate the ro...
This paper presents a method for reducing the effort of transcribing user utterances to develop lang...
Models of the acquisition of word segmen-tation are typically evaluated using phonem-ically transcri...
This paper considers discriminative training of language models for large vocabulary continuous spee...
International audience— In this paper we present an integrated unsupervised method to produce a qual...