This paper presents pre-processing of input features to artificial neural network (NN). This is for preparation of reliable reference templates for the set of words to be recognized. The processed features are pitch and Linear Predictive Coefficients (LPC) for input and reference templates, based on Dynamic Time Warping (DTW) algorithm. The first task is to extract pitch features using Pitch Scale Harmonic Filter (PSHF) algorithm [12]. Another task is to align the input frames (test set) to the reference template (training set) using DTW fixing frame (DTW-FF) algorithm. This proper time normalization is needed since NN is designed to compare data of the same length whilst same speech can varies in their length. By doing frame fixing (time n...
In most automatic speech recognition (ASR) systems, speaker differences are compensated by normalizi...
Reducing the transmission bandwidth and achieving higher speech quality are primary concerns in deve...
Augmenting datasets by transforming inputs in a way that does not change the label is a crucial ingr...
This paper presents pre-processing of input features to artificial neural network (NN). This is fo...
This paper presents pre-processing of input features to artificial neural network (NN). This is fo...
Abstract. A pre-processing of linear predictive coefficient (LPC) features for preparation of reliab...
This paper presents a method to extract existing speech features in dynamic time warping path which ...
This paper presents a method to extract existing speech features in dynamic time warping path which ...
This paper presents the neural network (NN) speech recognition using processed LPC input features. B...
This paper presents the pre-processing of speech templates for artificial neural network (ANN). The ...
This paper presents the neural network (NN) speech recognition using processed LPC input features. B...
This paper presents a procedure of frame normalization based on the traditional dynamic time warping...
Automatic Speech Recognition products are already available in the market since many years ago. Inte...
In this work, the speaker normalisation problem is afforded by two different techniques. The first o...
In most automatic speech recognition (ASR) systems, speaker differences are compensated by normalizi...
In most automatic speech recognition (ASR) systems, speaker differences are compensated by normalizi...
Reducing the transmission bandwidth and achieving higher speech quality are primary concerns in deve...
Augmenting datasets by transforming inputs in a way that does not change the label is a crucial ingr...
This paper presents pre-processing of input features to artificial neural network (NN). This is fo...
This paper presents pre-processing of input features to artificial neural network (NN). This is fo...
Abstract. A pre-processing of linear predictive coefficient (LPC) features for preparation of reliab...
This paper presents a method to extract existing speech features in dynamic time warping path which ...
This paper presents a method to extract existing speech features in dynamic time warping path which ...
This paper presents the neural network (NN) speech recognition using processed LPC input features. B...
This paper presents the pre-processing of speech templates for artificial neural network (ANN). The ...
This paper presents the neural network (NN) speech recognition using processed LPC input features. B...
This paper presents a procedure of frame normalization based on the traditional dynamic time warping...
Automatic Speech Recognition products are already available in the market since many years ago. Inte...
In this work, the speaker normalisation problem is afforded by two different techniques. The first o...
In most automatic speech recognition (ASR) systems, speaker differences are compensated by normalizi...
In most automatic speech recognition (ASR) systems, speaker differences are compensated by normalizi...
Reducing the transmission bandwidth and achieving higher speech quality are primary concerns in deve...
Augmenting datasets by transforming inputs in a way that does not change the label is a crucial ingr...