In building speech recognition based applications, robustness to different noisy background condition is an important challenge. In this paper bimodal approach is proposed to improve the robustness of Hindi speech recognition system. Also an importance of different types of visual features is studied for audio visual automatic speech recognition (AVASR) system under diverse noisy audio conditions. Four sets of visual feature based on Two-Dimensional Discrete Cosine Transform feature (2D-DCT), Principal Component Analysis (PCA), Two-Dimensional Discrete Wavelet Transform followed by DCT (2D-DWT-DCT) and Two-Dimensional Discrete Wavelet Transform followed by PCA (2D-DWT-PCA) are reported. The audio features are extracted using Mel Frequency C...
Abstract — Visual speech information from the speaker’s mouth region has been successfully shown to ...
Speech is the most natural means of communication among human beings and speech processing and recog...
Abstract: This paper presents a new multi pose audio visual speech recognition system based on fusio...
An Automatic Speech Recognition (ASR) system implementation uses a conventional pattern recognition ...
In this era of smart applications, Automatic Speech Recognition (ASR) has established itself as an e...
383-386In Automatic Speech Recognition (ASR) based system implementation, robustness to several nois...
Research work on the design of robust multimodal speech recognition systems making use of acoustic, ...
In this thesis, a number of important issues relating to the use of both audio and video information...
Speech is a natural way of communication and it provides an intuitive user interface to machines. Al...
Application specific voice interfaces in local languages will go a long way in reaching the benefits...
Background: The aim of the study was to develop a test material in Hindi language for assessing sent...
A Voice Oriented Interactive Computing Environment (VOICE) has been implemented in the Hindi languag...
This paper presents a baseline digits speech recognizer for Hindi language. The recording environmen...
Automatic Speech Recognition (ASR) is a flourishing and swift area for the conversion of acoustic si...
Abstract — Every person in the world want to share their information, thoughts from one person to an...
Abstract — Visual speech information from the speaker’s mouth region has been successfully shown to ...
Speech is the most natural means of communication among human beings and speech processing and recog...
Abstract: This paper presents a new multi pose audio visual speech recognition system based on fusio...
An Automatic Speech Recognition (ASR) system implementation uses a conventional pattern recognition ...
In this era of smart applications, Automatic Speech Recognition (ASR) has established itself as an e...
383-386In Automatic Speech Recognition (ASR) based system implementation, robustness to several nois...
Research work on the design of robust multimodal speech recognition systems making use of acoustic, ...
In this thesis, a number of important issues relating to the use of both audio and video information...
Speech is a natural way of communication and it provides an intuitive user interface to machines. Al...
Application specific voice interfaces in local languages will go a long way in reaching the benefits...
Background: The aim of the study was to develop a test material in Hindi language for assessing sent...
A Voice Oriented Interactive Computing Environment (VOICE) has been implemented in the Hindi languag...
This paper presents a baseline digits speech recognizer for Hindi language. The recording environmen...
Automatic Speech Recognition (ASR) is a flourishing and swift area for the conversion of acoustic si...
Abstract — Every person in the world want to share their information, thoughts from one person to an...
Abstract — Visual speech information from the speaker’s mouth region has been successfully shown to ...
Speech is the most natural means of communication among human beings and speech processing and recog...
Abstract: This paper presents a new multi pose audio visual speech recognition system based on fusio...