The frame-synchronized framework has dominated many speech processing systems, such as ASR and AED targeting human speech activities. These systems have little consideration for the science behind speech and treat the task as a simple statistical classification. The framework also assumes each feature vector to be equally important to the task. However, through some preliminary experiments, this study has found evidence that some concepts defined in speech perception theories such as auditory roughness and acoustic landmarks can act as heuristics to these systems and benefit them in multiple ways. Findings of acoustic landmarks hint that the idea of treating each frame equally might not be optimal. In some cases, landmark information can im...
While there have been many attempts to mitigate interferences of background noise, the performance o...
<p>The expertise required to develop a speech recognition system with reasonable accuracy for a give...
The performance of the speech recognition systems to translate voice to text is still an issue in la...
The frame-synchronized framework has dominated many speech processing systems, such as ASR and AED t...
The performance of an automatic speech recognition (ASR) system strongly depends on the representati...
Four aspects of human speech processing are dis-cussed along with their impact on the fundamen-tal s...
In spite of the effort and progress made during the last few decades, the performance of automatic s...
Contains fulltext : 178421.pdf (publisher's version ) (Open Access)Human speech co...
Kirchhoff K, Fink GA, Sagerer G. Combining acoustic and articulatory feature information for robust ...
This paper examines the usefulness of including prosodic and phonetic context information in the pho...
One view of speech perception is that acoustic signals are transformed into representations for patt...
Speech perception is an extremely difficult perceptual task that people do effortlessly. It requires...
Reverberation in speech degrades the performance of speech recognition systems, leading to higher wo...
Listeners outperform ASR systems in every speech recognition task. However, what is not clear is whe...
This paper examines the usefulness of including prosodic and phonetic context information in the pho...
While there have been many attempts to mitigate interferences of background noise, the performance o...
<p>The expertise required to develop a speech recognition system with reasonable accuracy for a give...
The performance of the speech recognition systems to translate voice to text is still an issue in la...
The frame-synchronized framework has dominated many speech processing systems, such as ASR and AED t...
The performance of an automatic speech recognition (ASR) system strongly depends on the representati...
Four aspects of human speech processing are dis-cussed along with their impact on the fundamen-tal s...
In spite of the effort and progress made during the last few decades, the performance of automatic s...
Contains fulltext : 178421.pdf (publisher's version ) (Open Access)Human speech co...
Kirchhoff K, Fink GA, Sagerer G. Combining acoustic and articulatory feature information for robust ...
This paper examines the usefulness of including prosodic and phonetic context information in the pho...
One view of speech perception is that acoustic signals are transformed into representations for patt...
Speech perception is an extremely difficult perceptual task that people do effortlessly. It requires...
Reverberation in speech degrades the performance of speech recognition systems, leading to higher wo...
Listeners outperform ASR systems in every speech recognition task. However, what is not clear is whe...
This paper examines the usefulness of including prosodic and phonetic context information in the pho...
While there have been many attempts to mitigate interferences of background noise, the performance o...
<p>The expertise required to develop a speech recognition system with reasonable accuracy for a give...
The performance of the speech recognition systems to translate voice to text is still an issue in la...