This paper presents a Bangla phoneme recognition method for Automatic Speech Recognition (ASR). The method consists of three stages: i) a multilayer neural network (MLN), which converts acoustic features, mel frequency cepstral coefficients (MFCCs), into phoneme probabilities, ii) the phoneme probabilities obtained from the first stage and corresponding Δ and ΔΔ are inserted into another MLN to improve the phoneme probabilities by reducing the context effect and (iii) the phoneme probabilities of current frame and corresponding MFCCs are fed into a hidden Markov model (HMM) based classifier to obtain more accurate phoneme strings. From the experiments on Bangla speech corpus prepared by us, it is observed that the proposed method provides h...
To carry out any kind of research in the field of speech signal processing, a standard database is e...
Speech Recognition is the process of converting an acoustic wave-form into text containing the simil...
Research work on the design of robust multimodal speech recognition systems making use of acoustic, ...
In this paper, we compare among performance of different acoustic features for Bangla Automatic Spee...
In this paper, we introduce a system for Bangla digit automatic speech recognition (ASR). Though Ban...
ABSTRACT In this work a new Bangla speech corpus along with proper transcriptions has been develope...
Phoneme recognition is important for successful development of speech recognizers in most real world...
Abstract: The baseline system of an automatic speech recognition normally uses Mel-Frequency Cepstra...
This paper is concerned with the development of Back-propagation Neural Network for Bangla Speech Re...
“Speech Recognition” of audio signal is important for telecommunication, language identification and...
In this work a new Bangla speech corpus along with proper transcriptions has been developed; also va...
The automatic identification of language from voice clips is known as automatic language identificat...
Abstract: This paper addresses the problem of speech recognition to identify various modes of speech...
This thesis report is submitted in partial fulfilment of the requirements for the degree of Bachelor...
Speech recognition has been an active research topic for more than 50 years. Interacting with the co...
To carry out any kind of research in the field of speech signal processing, a standard database is e...
Speech Recognition is the process of converting an acoustic wave-form into text containing the simil...
Research work on the design of robust multimodal speech recognition systems making use of acoustic, ...
In this paper, we compare among performance of different acoustic features for Bangla Automatic Spee...
In this paper, we introduce a system for Bangla digit automatic speech recognition (ASR). Though Ban...
ABSTRACT In this work a new Bangla speech corpus along with proper transcriptions has been develope...
Phoneme recognition is important for successful development of speech recognizers in most real world...
Abstract: The baseline system of an automatic speech recognition normally uses Mel-Frequency Cepstra...
This paper is concerned with the development of Back-propagation Neural Network for Bangla Speech Re...
“Speech Recognition” of audio signal is important for telecommunication, language identification and...
In this work a new Bangla speech corpus along with proper transcriptions has been developed; also va...
The automatic identification of language from voice clips is known as automatic language identificat...
Abstract: This paper addresses the problem of speech recognition to identify various modes of speech...
This thesis report is submitted in partial fulfilment of the requirements for the degree of Bachelor...
Speech recognition has been an active research topic for more than 50 years. Interacting with the co...
To carry out any kind of research in the field of speech signal processing, a standard database is e...
Speech Recognition is the process of converting an acoustic wave-form into text containing the simil...
Research work on the design of robust multimodal speech recognition systems making use of acoustic, ...