In this work a new Bangla speech corpus along with proper transcriptions has been developed; also various acoustic feature extraction methods have been investigated using Long Short-Term Memory (LSTM) neural network to find their effective integration into a state-of-the-art Bangla speech recognition system. The acoustic features are usually a sequence of representative vectors that are extracted from speech signals and the classes are either words or sub word units such as phonemes. The most commonly used feature extraction method, known as linear predictive coding (LPC), has been used first in this work. Then the other two popular methods, namely, the Mel frequency cepstral coefficients (MFCC) and perceptual linear prediction (PLP) have a...
The automatic identification of language from voice clips is known as automatic language identificat...
Speech has much capability as an interface between human and computer which comes under the Human Co...
Speech recognition can be defined as the process of converting voice signals into the ranks of the w...
ABSTRACT In this work a new Bangla speech corpus along with proper transcriptions has been develope...
ABSTRACT The performance of various acoustic feature extraction methods has been compared in this w...
The performance of various acoustic feature extraction methods has been compared in this work using ...
In this paper, we compare among performance of different acoustic features for Bangla Automatic Spee...
This paper presents a Bangla phoneme recognition method for Automatic Speech Recognition (ASR). The ...
Speech is a natural way of communication and it provides an intuitive user interface to machines. Al...
Speech recognition can be defined as the process of converting voice signals into the ranks of the w...
This paper is concerned with the development of Back-propagation Neural Network for Bangla Speech Re...
In this paper, we introduce a system for Bangla digit automatic speech recognition (ASR). Though Ban...
To carry out any kind of research in the field of speech signal processing, a standard database is e...
Speech recognition has been an active research topic for more than 50 years. Interacting with the co...
To provide new technological benefits to the mass people, nowadays, regional and local language reco...
The automatic identification of language from voice clips is known as automatic language identificat...
Speech has much capability as an interface between human and computer which comes under the Human Co...
Speech recognition can be defined as the process of converting voice signals into the ranks of the w...
ABSTRACT In this work a new Bangla speech corpus along with proper transcriptions has been develope...
ABSTRACT The performance of various acoustic feature extraction methods has been compared in this w...
The performance of various acoustic feature extraction methods has been compared in this work using ...
In this paper, we compare among performance of different acoustic features for Bangla Automatic Spee...
This paper presents a Bangla phoneme recognition method for Automatic Speech Recognition (ASR). The ...
Speech is a natural way of communication and it provides an intuitive user interface to machines. Al...
Speech recognition can be defined as the process of converting voice signals into the ranks of the w...
This paper is concerned with the development of Back-propagation Neural Network for Bangla Speech Re...
In this paper, we introduce a system for Bangla digit automatic speech recognition (ASR). Though Ban...
To carry out any kind of research in the field of speech signal processing, a standard database is e...
Speech recognition has been an active research topic for more than 50 years. Interacting with the co...
To provide new technological benefits to the mass people, nowadays, regional and local language reco...
The automatic identification of language from voice clips is known as automatic language identificat...
Speech has much capability as an interface between human and computer which comes under the Human Co...
Speech recognition can be defined as the process of converting voice signals into the ranks of the w...