Including information distributed over intervals of syllabic duration (100--250 ms) may greatly improve the performance of automatic speech recognition (ASR) systems. ASR systems primarily use representations and recognition units covering phonetic durations (40--100 ms). Humans certainly use information at phonetic time scales, but results from psychoacoustics and psycholinguistics highlight the crucial role of the syllable, and syllable-length intervals, in speech perception. We compare the performance of three ASR systems: a baseline system that uses phone-scale representations and units, an experimental system that uses a syllableoriented front-end representation and syllabic units for recognition, and a third system that combines the p...
In this paper, we propose and investigate a new approach towards using multiple time scale informati...
This paper explores the use of the phone and syllable as primary units of representation in the rst ...
An exploratory implementation of a syllable—based recognizer is described. Continuous speech is firs...
The accuracy of speech recognition systems is known to be affected by fast speech. If fast speech ca...
Humans are able to recognise a word before its acoustic realisation is complete. This in contrast to...
It is well known that a higher-than-normal speech rate will cause the rate of recognition errors in ...
The performance of automatic speech recognition systems is usually assessed in terms of error rate. ...
Humans are able to recognise a word before its acoustic realisation is complete. This in contrast to...
Previous research comparing detection times for syllables and for phonemes has consistently found th...
Transforming an acoustic signal to words is the gold standard in automatic speech recognition. Whil...
Thesis (Ph.D.)--Harvard--Massachusetts Institute of Technology Division of Health Sciences and Techn...
In this paper, we present an automatic speech recognition (ASR) system based on the combination of a...
Word recognition techniques are reviewed. An exhaustive comparative study of many of the factors tha...
Auditory word recognition proceeds fluidly despite numerous perturbations and obstacles that exist i...
Speech recognition is the process of converting speech signals to the text. Studies on speech recogn...
In this paper, we propose and investigate a new approach towards using multiple time scale informati...
This paper explores the use of the phone and syllable as primary units of representation in the rst ...
An exploratory implementation of a syllable—based recognizer is described. Continuous speech is firs...
The accuracy of speech recognition systems is known to be affected by fast speech. If fast speech ca...
Humans are able to recognise a word before its acoustic realisation is complete. This in contrast to...
It is well known that a higher-than-normal speech rate will cause the rate of recognition errors in ...
The performance of automatic speech recognition systems is usually assessed in terms of error rate. ...
Humans are able to recognise a word before its acoustic realisation is complete. This in contrast to...
Previous research comparing detection times for syllables and for phonemes has consistently found th...
Transforming an acoustic signal to words is the gold standard in automatic speech recognition. Whil...
Thesis (Ph.D.)--Harvard--Massachusetts Institute of Technology Division of Health Sciences and Techn...
In this paper, we present an automatic speech recognition (ASR) system based on the combination of a...
Word recognition techniques are reviewed. An exhaustive comparative study of many of the factors tha...
Auditory word recognition proceeds fluidly despite numerous perturbations and obstacles that exist i...
Speech recognition is the process of converting speech signals to the text. Studies on speech recogn...
In this paper, we propose and investigate a new approach towards using multiple time scale informati...
This paper explores the use of the phone and syllable as primary units of representation in the rst ...
An exploratory implementation of a syllable—based recognizer is described. Continuous speech is firs...