International audienceStandard Visual Speech Recognition (VSR) systems directly process images as input features without any apriori link between raw pixel data and facial traits. Pixel information is smartly sieved when facial landmarks are extracted from pictures and repurposed as graph nodes. Their evolution through time is thus modeled by a Graph Convolutional Network. However, with graph-based VSR being in its infancy, the selection of points and their correlation are still ill-defined and often bound to aprioristic knowledge and handcrafted techniques. In this paper, we investigate the graph approach for VSR and its ability to learn the correlation between points beyond the mouth region. We also study the different contributions that ...
In this paper we propose a new learning-based representation that is referred to as Visual Speech Un...
Visual Speech Recognition (VSR) is a process of understanding speech by interpreting visual informat...
This thesis describes how an automatic lip reader was realized. Visual speech recognition is a preco...
Obtaining a robust feature representation of visual speech is\ud of crucial importance in the design...
In visual speech recognition (VSR), speech is transcribed using only visual information to interpret...
Abstract. In this paper an evaluation of visual speech features is performed specifically for the ta...
We study the problem of automatic visual speech recognition (VSR) using dynamic Bayesian network (DB...
Visual speech recognition (VSR) aims to recognize the content of speech based on lip movements, with...
One of the most challenging tasks in automatic visual speech recognition is the extraction of featur...
Visual speech, referring to the visual domain of speech, has attracted increasing attention due to i...
The objective of this work is visual recognition of speech and gestures. Solving this problem opens ...
Visual information from a speaker's mouth region is known to improve automatic speech recognition ro...
Visual Speech Recognition (VSR) related studies largely ignore the use of state of the art approache...
Automatic speech recognition (ASR) performs well under restricted conditions, but performance degrad...
In recent research efforts, the integration of visual cues into speech analysis systems has been pro...
In this paper we propose a new learning-based representation that is referred to as Visual Speech Un...
Visual Speech Recognition (VSR) is a process of understanding speech by interpreting visual informat...
This thesis describes how an automatic lip reader was realized. Visual speech recognition is a preco...
Obtaining a robust feature representation of visual speech is\ud of crucial importance in the design...
In visual speech recognition (VSR), speech is transcribed using only visual information to interpret...
Abstract. In this paper an evaluation of visual speech features is performed specifically for the ta...
We study the problem of automatic visual speech recognition (VSR) using dynamic Bayesian network (DB...
Visual speech recognition (VSR) aims to recognize the content of speech based on lip movements, with...
One of the most challenging tasks in automatic visual speech recognition is the extraction of featur...
Visual speech, referring to the visual domain of speech, has attracted increasing attention due to i...
The objective of this work is visual recognition of speech and gestures. Solving this problem opens ...
Visual information from a speaker's mouth region is known to improve automatic speech recognition ro...
Visual Speech Recognition (VSR) related studies largely ignore the use of state of the art approache...
Automatic speech recognition (ASR) performs well under restricted conditions, but performance degrad...
In recent research efforts, the integration of visual cues into speech analysis systems has been pro...
In this paper we propose a new learning-based representation that is referred to as Visual Speech Un...
Visual Speech Recognition (VSR) is a process of understanding speech by interpreting visual informat...
This thesis describes how an automatic lip reader was realized. Visual speech recognition is a preco...