International audienceWe propose a computational model of speech production combining a pre-trained neural articulatory synthesizer able to reproduce complex speech stimuli from a limited set of interpretable articulatory parameters, a DNN-based internal forward model predicting the sensory consequences of articulatory commands, and an internal inverse model based on a recurrent neural network recovering articulatory commands from the acoustic speech input. Both forward and inverse models are jointly trained in a self-supervised way from raw acoustic-only speech data from different speakers. The imitation simulations are evaluated objectively and subjectively and display quite encouraging performances
Visual feedback of articulators using Electromagnetic- Articulography (EMA) has been shown to aid ac...
Generative deep neural networks are widely used for speech synthesis, but most existing models direc...
International audienceDeriving articulatory dynamics from the acoustic speech signal has been addres...
International audienceWe propose a computational model of speech production combining a pre-trained ...
Philippsen A, Reinhart F, Wrede B. Learning How to Speak: Imitation-Based Refinement of Syllable Pro...
Three neural network models were trained on the forward mapping from articulatory positions to acous...
Imitation is a powerful mechanism by which both animals and people can learn useful behavior, by cop...
The goal of our current project is to build a system that can learn to imitate a version of a spoken...
Articulatory copy synthesis (ACS), a subarea of speech inversion, refers to the reproduction of natu...
This paper describes a mapping problem that tests and validates the findings from our analytical ana...
Sensorimotor learning represents a challenging problem for natural and artificial systems. Several c...
This paper describes a model of speech production called DIVA that highlights issues of self-organiz...
A full 3D physiological articulatory model was developed to simulate the mechanism of human speech p...
Neural network (NN) applications have recently been employed to extract the parameters of an artic-u...
Representation learning is one of the fundamental issues in modeling articulatory-based speech synth...
Visual feedback of articulators using Electromagnetic- Articulography (EMA) has been shown to aid ac...
Generative deep neural networks are widely used for speech synthesis, but most existing models direc...
International audienceDeriving articulatory dynamics from the acoustic speech signal has been addres...
International audienceWe propose a computational model of speech production combining a pre-trained ...
Philippsen A, Reinhart F, Wrede B. Learning How to Speak: Imitation-Based Refinement of Syllable Pro...
Three neural network models were trained on the forward mapping from articulatory positions to acous...
Imitation is a powerful mechanism by which both animals and people can learn useful behavior, by cop...
The goal of our current project is to build a system that can learn to imitate a version of a spoken...
Articulatory copy synthesis (ACS), a subarea of speech inversion, refers to the reproduction of natu...
This paper describes a mapping problem that tests and validates the findings from our analytical ana...
Sensorimotor learning represents a challenging problem for natural and artificial systems. Several c...
This paper describes a model of speech production called DIVA that highlights issues of self-organiz...
A full 3D physiological articulatory model was developed to simulate the mechanism of human speech p...
Neural network (NN) applications have recently been employed to extract the parameters of an artic-u...
Representation learning is one of the fundamental issues in modeling articulatory-based speech synth...
Visual feedback of articulators using Electromagnetic- Articulography (EMA) has been shown to aid ac...
Generative deep neural networks are widely used for speech synthesis, but most existing models direc...
International audienceDeriving articulatory dynamics from the acoustic speech signal has been addres...