International audienceIn this paper, we present a statistical method based on GMM modeling to map the acoustic speech spectral features to visual features of Cued Speech in the sense of least square error in a low signal level which is innovative and different with the classic text-to-visual approach. In comparison with the GMM based mapping modeling we first present the results with the use of a multi-linear model also at the low signal level and study the limitation of the approach. The experimental results demonstrate that the GMM based mapping method can significant improve the mapping performance compared with the multi-linear based mapping model especial in the sense of the weak linear correlation between the target and the predictor ...
The concept of using visual information as part of audio speech processing has been of significant r...
This paper presents recent developments on our “silent speech interface ” that converts tongue and l...
In their everyday life, the speech recognition performance of human listeners is influenced by diver...
International audienceIn this paper, we present a statistical method based on GMM modeling to map th...
International audienceIn this paper, we present a statistical method based on GMM modeling to map th...
Cued Speech (CS) is a visual communication system that uses hand shapes placed in different position...
The objective of this work is to study the suitability of existing spectral mapping methods for enha...
Le langage parlé complété (LPC) est un système de communication visuel qui utilise des formes de mai...
This thesis presents an exploratory research on the application of a nonlinear multiscale formalism,...
In this paper, a voice conversion approach that combines two distinct ideas is pro-posed to improve ...
The objective of this work is to represent the information in the speech signal picked up by a throa...
International audienceThis article investigates the use of statistical mapping techniques for the co...
Note:This study aims to apply the Statistical Signal Mapping method to robust speech recognition. Us...
This paper examines the degree of correlation between lip and jaw conguration and speech acoustics. ...
This work begins by examining the correlation between audio and visual speech features and reveals h...
The concept of using visual information as part of audio speech processing has been of significant r...
This paper presents recent developments on our “silent speech interface ” that converts tongue and l...
In their everyday life, the speech recognition performance of human listeners is influenced by diver...
International audienceIn this paper, we present a statistical method based on GMM modeling to map th...
International audienceIn this paper, we present a statistical method based on GMM modeling to map th...
Cued Speech (CS) is a visual communication system that uses hand shapes placed in different position...
The objective of this work is to study the suitability of existing spectral mapping methods for enha...
Le langage parlé complété (LPC) est un système de communication visuel qui utilise des formes de mai...
This thesis presents an exploratory research on the application of a nonlinear multiscale formalism,...
In this paper, a voice conversion approach that combines two distinct ideas is pro-posed to improve ...
The objective of this work is to represent the information in the speech signal picked up by a throa...
International audienceThis article investigates the use of statistical mapping techniques for the co...
Note:This study aims to apply the Statistical Signal Mapping method to robust speech recognition. Us...
This paper examines the degree of correlation between lip and jaw conguration and speech acoustics. ...
This work begins by examining the correlation between audio and visual speech features and reveals h...
The concept of using visual information as part of audio speech processing has been of significant r...
This paper presents recent developments on our “silent speech interface ” that converts tongue and l...
In their everyday life, the speech recognition performance of human listeners is influenced by diver...