The Multi-State Time Delay Neural Network (MS-TDNN) inte-grates a nonlinear time alignment procedure (DTW) and the high-accuracy phoneme spotting capabilities of a TDNN into a connec-tionist speech recognition system with word-level classification and error backpropagation. We present an MS-TDNN for recognizing continuously spelled letters, a task characterized by a small but highly confusable vocabulary. Our MS-TDNN achieves 98.5/92.0% word accuracy on speaker dependent/independent tasks, outper-forming previously reported results on the same databases. We pro-pose training techniques aimed at improving sentence level perfor-mance, including free alignment across word boundaries, word du-ration modeling and error backpropagation on the sen...
neme recognition which is characterized by two important properties: 1.) Using a 3 layer arrangement...
This paper presents a recognition system for isolated words like robot commands. It’s carried out by...
Neural networks have recently been applied to real-world speech recognition problems with a great de...
We present a number of Time-Delay Neural Network (TDNN) based architectures for multi-speaker phonem...
bregler @ irauka.de, manke @ irauka.de In this paper we show how recognition performance in automate...
In this paper we show how recognition performance in automated speech perception can be significantl...
w ai bel CO) cs. cm u. ed u In this paper we present NPen ++, a connectionist system for writer inde...
In this paper we address the problem of text-to-phoneme (TTP) mapping implemented by neural networks...
Time delay neural networks (TDNNs) are an effective acoustic model for large vocabulary speech recog...
An alternative view of neural network based phoneme recognition based on multiresolution signal proc...
This paper presents the neural network (NN) speech recognition using processed LPC input features. B...
Abstract: Phoneme classification and recognition is the first step to large vocabulary continuous sp...
This paper presents a writer independent system for large vocabulary recognition of on-line handwrit...
This paper describes an improved input coding method for a text-to-phoneme (TTP) neural network mode...
An important yet challenging task for neural network based speech recognizers is the effective proce...
neme recognition which is characterized by two important properties: 1.) Using a 3 layer arrangement...
This paper presents a recognition system for isolated words like robot commands. It’s carried out by...
Neural networks have recently been applied to real-world speech recognition problems with a great de...
We present a number of Time-Delay Neural Network (TDNN) based architectures for multi-speaker phonem...
bregler @ irauka.de, manke @ irauka.de In this paper we show how recognition performance in automate...
In this paper we show how recognition performance in automated speech perception can be significantl...
w ai bel CO) cs. cm u. ed u In this paper we present NPen ++, a connectionist system for writer inde...
In this paper we address the problem of text-to-phoneme (TTP) mapping implemented by neural networks...
Time delay neural networks (TDNNs) are an effective acoustic model for large vocabulary speech recog...
An alternative view of neural network based phoneme recognition based on multiresolution signal proc...
This paper presents the neural network (NN) speech recognition using processed LPC input features. B...
Abstract: Phoneme classification and recognition is the first step to large vocabulary continuous sp...
This paper presents a writer independent system for large vocabulary recognition of on-line handwrit...
This paper describes an improved input coding method for a text-to-phoneme (TTP) neural network mode...
An important yet challenging task for neural network based speech recognizers is the effective proce...
neme recognition which is characterized by two important properties: 1.) Using a 3 layer arrangement...
This paper presents a recognition system for isolated words like robot commands. It’s carried out by...
Neural networks have recently been applied to real-world speech recognition problems with a great de...