Recent advances in the automatic recognition of audiovisual speech

Gerasimos Potamianos
Chalapathy Neti
Guillaume Gravier
Ashutosh Garg
Student Member
Andrew W. Senior

Publication date

July 2015

Abstract

Abstract — Visual speech information from the speaker’s mouth region has been successfully shown to improve noise robustness of automatic speech recognizers, thus promising to extend their usability into the human computer interface. In this paper, we review the main components of audio-visual automatic speech recognition and present novel contributions in two main areas: First, the visual front end design, based on a cascade of linear image transforms of an appropriate video region-of-interest, and subsequently, audio-visual speech integration. On the later topic, we discuss new work on feature and decision fusion combination, the modeling of audio-visual speech asynchrony, and incorporating modality reliability estimates to the bimodal re...

Extracted data

We use cookies to provide a better user experience.

Data Protection

Recent advances in the automatic recognition of audiovisual speech

Abstract

Extracted data

Recent advances in the automatic recognition of audiovisual speech

Abstract

Extracted data

Related items

Related items