This thesis elaborates the use of speech production knowledge in the form of articulatory phonetic features to improve the robustness of speech recognition in practical situations. The main concept is that natural speech has three attributes in the human speech processing system, i.e., the motor activation, the articulatory trajectory, and the auditory perception. Consequently, the research work has three components. First, it describes an adaptive neural control model, which reproduces the articulatory trajectories and retrieves the motor activation patterns in a bio-mechanical speech synthesizer. Second, by manipulating the elastic vocal tract walls, the synthesizer produces the overall articulatory-to-acoustic trajectory map for English ...
The human speech apparatus is a rich source of information and offers many cues in the speech signal...
Kirchhoff K, Fink GA, Sagerer G. Combining acoustic and articulatory feature information for robust ...
We investigate the use of phonetic motor invariants (MIs), that is, recurring kinematic patterns of ...
We describe a neural based articulatory phonetic inversion model to improve the recognition of the a...
This paper presents a speech recognition technique based on inhibition/enhancement (In/En) of articu...
In this paper, we examined the feasibility of articulatory phonetic inversion (API) conditioned on t...
This book discusses the contribution of articulatory and excitation source information in discrimina...
Articulatory copy synthesis (ACS), a subarea of speech inversion, refers to the reproduction of natu...
We report on investigations, conducted at the 2006 JHU Summer Workshop, of the use of articulatory f...
State of the art articulatory synthesis is investigated through an examination of related parameters...
Speech recognition has become common in many application domains. Incorporating acoustic-phonetic kn...
We describe a speech recognition system which uses articulatory parameters as basic features and pho...
In this paper we investigate the use of articulatory data for speech recognition. Recordings of the ...
This paper presents a deep neural network (DNN) to extract articulatory information from the speech ...
The organization of a computational control model of articulatory speech synthesis is outlined in th...
The human speech apparatus is a rich source of information and offers many cues in the speech signal...
Kirchhoff K, Fink GA, Sagerer G. Combining acoustic and articulatory feature information for robust ...
We investigate the use of phonetic motor invariants (MIs), that is, recurring kinematic patterns of ...
We describe a neural based articulatory phonetic inversion model to improve the recognition of the a...
This paper presents a speech recognition technique based on inhibition/enhancement (In/En) of articu...
In this paper, we examined the feasibility of articulatory phonetic inversion (API) conditioned on t...
This book discusses the contribution of articulatory and excitation source information in discrimina...
Articulatory copy synthesis (ACS), a subarea of speech inversion, refers to the reproduction of natu...
We report on investigations, conducted at the 2006 JHU Summer Workshop, of the use of articulatory f...
State of the art articulatory synthesis is investigated through an examination of related parameters...
Speech recognition has become common in many application domains. Incorporating acoustic-phonetic kn...
We describe a speech recognition system which uses articulatory parameters as basic features and pho...
In this paper we investigate the use of articulatory data for speech recognition. Recordings of the ...
This paper presents a deep neural network (DNN) to extract articulatory information from the speech ...
The organization of a computational control model of articulatory speech synthesis is outlined in th...
The human speech apparatus is a rich source of information and offers many cues in the speech signal...
Kirchhoff K, Fink GA, Sagerer G. Combining acoustic and articulatory feature information for robust ...
We investigate the use of phonetic motor invariants (MIs), that is, recurring kinematic patterns of ...