Thesis (Ph.D.)--Harvard--Massachusetts Institute of Technology Division of Health Sciences and Technology, 1998.Includes bibliographical references (p. 113-117).The goal of this research is to determine how aspects of human speech processing can be utilized to improve the performance of Automatic Speech Recognition (ASR) systems. Three traditional ASR parameterizations matched with Hidden Markov Models (HMMs) are compared to humans on a consonant recognition task using Consonant Vowel- Consonant (CVC) nonsense syllables degraded by highpass filtering, lowpass filtering, or additive noise. Confusion matrices were determined by recognizing the syllabies using different ASR front ends, including Mel-Filter Bank (MFB) energies, Mel-F filtered C...
sitä The effect of bio-inspired spectro-temporal processing for automatic speech recognition (ASR) ...
Natural language processing enables computer and machines to understand and speak human languages. S...
This thesis addresses the general problem of maintaining robust automatic speech recognition (ASR) p...
Three time-varying functions, which can be extracted. directly from the raw speech waveform, are of ...
Thesis (Ph.D.)--Massachusetts Institute of Technology, Dept. of Electrical Engineering and Computer ...
State-of-the-art automatic speech recognition (ASR) systems are significantly inferior to humans esp...
The performance of an automatic speech recognition (ASR) system strongly depends on the representati...
Speech in a noisy background presents a challenge for the recognition of that speech both by human l...
Listeners outperform ASR systems in every speech recognition task. However, what is not clear is whe...
Thesis (Ph. D.)—Massachusetts Institute of Technology, Dept. of Electrical Engineering and Computer ...
Communication is the basic need of everyone for a person who want to survive in this world. Research...
English nonsense consonant-vowel-consonant syllables were presented at four different signal-to-nois...
This study is a first step in selecting an appropriate subword unit representation to synthesize hig...
[[abstract]]Normal-hearing subjects' recognition of spectrally degraded speech was evaluated under c...
Statistical data-driven methods and knowledge-based methods are two recent trends in Automatic Speec...
sitä The effect of bio-inspired spectro-temporal processing for automatic speech recognition (ASR) ...
Natural language processing enables computer and machines to understand and speak human languages. S...
This thesis addresses the general problem of maintaining robust automatic speech recognition (ASR) p...
Three time-varying functions, which can be extracted. directly from the raw speech waveform, are of ...
Thesis (Ph.D.)--Massachusetts Institute of Technology, Dept. of Electrical Engineering and Computer ...
State-of-the-art automatic speech recognition (ASR) systems are significantly inferior to humans esp...
The performance of an automatic speech recognition (ASR) system strongly depends on the representati...
Speech in a noisy background presents a challenge for the recognition of that speech both by human l...
Listeners outperform ASR systems in every speech recognition task. However, what is not clear is whe...
Thesis (Ph. D.)—Massachusetts Institute of Technology, Dept. of Electrical Engineering and Computer ...
Communication is the basic need of everyone for a person who want to survive in this world. Research...
English nonsense consonant-vowel-consonant syllables were presented at four different signal-to-nois...
This study is a first step in selecting an appropriate subword unit representation to synthesize hig...
[[abstract]]Normal-hearing subjects' recognition of spectrally degraded speech was evaluated under c...
Statistical data-driven methods and knowledge-based methods are two recent trends in Automatic Speec...
sitä The effect of bio-inspired spectro-temporal processing for automatic speech recognition (ASR) ...
Natural language processing enables computer and machines to understand and speak human languages. S...
This thesis addresses the general problem of maintaining robust automatic speech recognition (ASR) p...