Representation learning is one of the fundamental issues in modeling articulatory-based speech synthesis using target-driven models. This paper proposes a computational strategy for learning underlying articulatory targets from a 3D articulatory speech synthesis model using a bi-directional long short-term memory recurrent neural network based on a small set of representative seed samples. From a seeding set, a larger training set was generated that provided richer contextual variations for the model to learn. The deep learning model for acoustic-to-target mapping was then trained to model the inverse relation of the articulation process. This method allows the trained model to map the given acoustic data onto the articulatory target parame...
We describe a neural based articulatory phonetic inversion model to improve the recognition of the a...
A new approach of recognizing vowels from articulatory position time-series data was proposed and te...
International audienceThe aim of this work is to develop an algorithm for controlling the articulato...
Philippsen A, Reinhart F, Wrede B. Learning How to Speak: Imitation-Based Refinement of Syllable Pro...
While the acoustic vowel space has been extensively studied in previous research, little is known ab...
International audienceWe propose a computational model of speech production combining a pre-trained ...
For Thai vowel pronunciation, it is very important to know that when mispronunciation occurs, the me...
Within the past decades advances in neural networks have improved the performance of a vast area of ...
Articulatory copy synthesis (ACS), a subarea of speech inversion, refers to the reproduction of natu...
International audienceIn this work, we address the prediction of speech articulators' temporal geome...
International audienceDeriving articulatory dynamics from the acoustic speech signal has been addres...
Three neural network models were trained on the forward mapping from articulatory positions to acous...
Generative deep neural networks are widely used for speech synthesis, but most existing models direc...
International audienceAcoustic simulations used in the articulatory synthesis of speech take a serie...
We address the problem of reconstructing articulatory movements, given audio and/or phonetic labels....
We describe a neural based articulatory phonetic inversion model to improve the recognition of the a...
A new approach of recognizing vowels from articulatory position time-series data was proposed and te...
International audienceThe aim of this work is to develop an algorithm for controlling the articulato...
Philippsen A, Reinhart F, Wrede B. Learning How to Speak: Imitation-Based Refinement of Syllable Pro...
While the acoustic vowel space has been extensively studied in previous research, little is known ab...
International audienceWe propose a computational model of speech production combining a pre-trained ...
For Thai vowel pronunciation, it is very important to know that when mispronunciation occurs, the me...
Within the past decades advances in neural networks have improved the performance of a vast area of ...
Articulatory copy synthesis (ACS), a subarea of speech inversion, refers to the reproduction of natu...
International audienceIn this work, we address the prediction of speech articulators' temporal geome...
International audienceDeriving articulatory dynamics from the acoustic speech signal has been addres...
Three neural network models were trained on the forward mapping from articulatory positions to acous...
Generative deep neural networks are widely used for speech synthesis, but most existing models direc...
International audienceAcoustic simulations used in the articulatory synthesis of speech take a serie...
We address the problem of reconstructing articulatory movements, given audio and/or phonetic labels....
We describe a neural based articulatory phonetic inversion model to improve the recognition of the a...
A new approach of recognizing vowels from articulatory position time-series data was proposed and te...
International audienceThe aim of this work is to develop an algorithm for controlling the articulato...