The design of speech recognition system based on acous-tically-derived, segmental units can be divided in three steps: unit design, lexicon building and pronunciation mod-eling. We formulate an iterative unit design procedure which consistently uses a maximum likelihood (ML) objec-tive in successive application of resegmentation and model re-estimation. The lexicon building allows multi-word en-tries in the lexicon but restricts the number of these entries in order to avoid a too costly search. Selected multi-word lexical entries are those with high frequency (such as func-tion words) and those which consistently exhibit cross-word phone assimilation. The stochastic pronunciation model represents the likelihood of a particular acoustic segm...
Here is presented a phonetic source model whose parameters, estimated from phonetically transcribed ...
In the last few years the field of computer speech recognition has come into its own as a practical ...
This work is intended to explore the performance of a new set of acoustic model units in speech reco...
This paper describes a new method of word model gener-ation based on acoustically derived segment un...
In this paper, we describe important improvements that were recently introduced in our Discriminativ...
The performance of speech recognition systems degrades when the basic sound units used are poorly de...
Statistical data-driven methods and knowledge-based methods are two recent trends in Automatic Speec...
An automatic segmentation method is tested here, which uses a combination of entropy coding, continu...
This paper explores the use of the phone and syllable as primary units of representation in the rst ...
Segment vocoders play a special role in very low bitrate speech coding to achieve intelligible speec...
In this paper, we present some recent improvements in our automatic speech segmentation system, whic...
For segmenting a speech database, using a family of acoustic models provides multiple estimates of e...
Phonetic segmentation is the breakup and classication of the sound signal into a string of phones. T...
Recent work in phonetic speaker recognition has shown that modeling phone sequences using n-grams is...
The purpose of this research is to determine the best method for deciding on an optimal set of conca...
Here is presented a phonetic source model whose parameters, estimated from phonetically transcribed ...
In the last few years the field of computer speech recognition has come into its own as a practical ...
This work is intended to explore the performance of a new set of acoustic model units in speech reco...
This paper describes a new method of word model gener-ation based on acoustically derived segment un...
In this paper, we describe important improvements that were recently introduced in our Discriminativ...
The performance of speech recognition systems degrades when the basic sound units used are poorly de...
Statistical data-driven methods and knowledge-based methods are two recent trends in Automatic Speec...
An automatic segmentation method is tested here, which uses a combination of entropy coding, continu...
This paper explores the use of the phone and syllable as primary units of representation in the rst ...
Segment vocoders play a special role in very low bitrate speech coding to achieve intelligible speec...
In this paper, we present some recent improvements in our automatic speech segmentation system, whic...
For segmenting a speech database, using a family of acoustic models provides multiple estimates of e...
Phonetic segmentation is the breakup and classication of the sound signal into a string of phones. T...
Recent work in phonetic speaker recognition has shown that modeling phone sequences using n-grams is...
The purpose of this research is to determine the best method for deciding on an optimal set of conca...
Here is presented a phonetic source model whose parameters, estimated from phonetically transcribed ...
In the last few years the field of computer speech recognition has come into its own as a practical ...
This work is intended to explore the performance of a new set of acoustic model units in speech reco...