This paper explores the use of the phone and syllable as primary units of representation in the rst stage of a two-stage recognizer. A nite-state transducer speech rec-ognizer is utilized to congure the recognition as a two-stage process, where either phone or syllable graphs are computed in the rst stage, and passed to the second stage to determine the most likely word hypotheses. Preliminary experiments in a weather information speech understanding domain show that a syllable representation with either bi-gram or trigram language models provides more constraint than a phonetic representation with a higher-order n-gram language model (up to a 6-gram), and approaches the per-formance of a more conventional single-stage word-based con gurati...
multiple streams of semi-independent phonological features, rather than phones • From speech recogni...
This paper presents a new approach to phoneme recognition using nonsequential sub--phoneme units. Th...
Techniques for automatic phoneme recognition from spoken speech are investigated. The goal is to ext...
Speech recognition is the process of converting speech signals to the text. Studies on speech recogn...
Including information distributed over intervals of syllabic duration (100--250 ms) may greatly impr...
We describe a speech recogniser which uses a speech production-motivated phonetic-feature descriptio...
A realistic model of speech recognition and understanding should be heavily based both on linguistic...
The design of speech recognition system based on acous-tically-derived, segmental units can be divid...
We describe recent work on two new automatic speech recognition systems. The first part of this pape...
This paper describes a new method of word model gener-ation based on acoustically derived segment un...
Contains fulltext : 75081.pdf (publisher's version ) (Open Access)This letter eval...
Introduction At the current state of the art, high-accuracy speech recognition with moderate to lar...
Machine recognition of spoken language requires developing more robust recognition algorithms. The...
This paper describes our experiences with developing a realtime telephone-based speech recognizer as...
The MIT SUMMIT speech recognition system models pronunciation using a phonemic baseform dictionary a...
multiple streams of semi-independent phonological features, rather than phones • From speech recogni...
This paper presents a new approach to phoneme recognition using nonsequential sub--phoneme units. Th...
Techniques for automatic phoneme recognition from spoken speech are investigated. The goal is to ext...
Speech recognition is the process of converting speech signals to the text. Studies on speech recogn...
Including information distributed over intervals of syllabic duration (100--250 ms) may greatly impr...
We describe a speech recogniser which uses a speech production-motivated phonetic-feature descriptio...
A realistic model of speech recognition and understanding should be heavily based both on linguistic...
The design of speech recognition system based on acous-tically-derived, segmental units can be divid...
We describe recent work on two new automatic speech recognition systems. The first part of this pape...
This paper describes a new method of word model gener-ation based on acoustically derived segment un...
Contains fulltext : 75081.pdf (publisher's version ) (Open Access)This letter eval...
Introduction At the current state of the art, high-accuracy speech recognition with moderate to lar...
Machine recognition of spoken language requires developing more robust recognition algorithms. The...
This paper describes our experiences with developing a realtime telephone-based speech recognizer as...
The MIT SUMMIT speech recognition system models pronunciation using a phonemic baseform dictionary a...
multiple streams of semi-independent phonological features, rather than phones • From speech recogni...
This paper presents a new approach to phoneme recognition using nonsequential sub--phoneme units. Th...
Techniques for automatic phoneme recognition from spoken speech are investigated. The goal is to ext...