Phonological-based features (articulatory features, AFs) describe the movements of the vocal organ which are shared across languages. This paper investigates a domain-adversarial neural network (DANN) to extract reliable AFs, and different multi-stream techniques are used for cross-lingual speech recognition. First, a novel universal phonological attributes definition is proposed for Mandarin, English, German and French. Then a DANN-based AFs detector is trained using source languages (English, German and French). When doing the cross-lingual speech recognition, the AFs detectors are used to transfer the phonological knowledge from source languages (English, German and French) to the target language (Mandarin). Two multi-stream approaches a...
Exploiting cross-lingual resources is an effective way to compensate for data scarcity of low resour...
We investigate multilingual modeling in the context of a deep neural network (DNN) – hidden Markov ...
Posterior-based or bottleneck features derived from neural net-works trained on out-of-domain data m...
Phonological-based features (articulatory features, AFs) describe the movements of the vocal organ w...
Articulatory features (AFs) provide language-independent attribute by exploiting the speech producti...
© 2014 IEEE. Speech signals are produced by the smooth and continuous movements of the human articul...
Automatic speech recognition requires many hours of transcribed speech recordings in order for an ac...
Articulatory features describe the way in which the speech organs are used when producing speech sou...
Multilingual speech recognition systems mostly benefit low resource languages but suffer degradation...
In this work, we propose several deep neural network architectures that are able to leverage data fr...
In recent years, the features derived from posteriors of a multilayer perceptron (MLP), known as tan...
Multilingual automatic speech recognition (ASR) systems mostly benefit low resource languages but su...
The use of articulatory features, such as place and manner of articulation, has been shown to reduce...
Deep neural network (DNN) acoustic models can be adapted to under-resourced languages by transferrin...
The use of articulatory features, such as place and manner of articulation, has been shown to reduce...
Exploiting cross-lingual resources is an effective way to compensate for data scarcity of low resour...
We investigate multilingual modeling in the context of a deep neural network (DNN) – hidden Markov ...
Posterior-based or bottleneck features derived from neural net-works trained on out-of-domain data m...
Phonological-based features (articulatory features, AFs) describe the movements of the vocal organ w...
Articulatory features (AFs) provide language-independent attribute by exploiting the speech producti...
© 2014 IEEE. Speech signals are produced by the smooth and continuous movements of the human articul...
Automatic speech recognition requires many hours of transcribed speech recordings in order for an ac...
Articulatory features describe the way in which the speech organs are used when producing speech sou...
Multilingual speech recognition systems mostly benefit low resource languages but suffer degradation...
In this work, we propose several deep neural network architectures that are able to leverage data fr...
In recent years, the features derived from posteriors of a multilayer perceptron (MLP), known as tan...
Multilingual automatic speech recognition (ASR) systems mostly benefit low resource languages but su...
The use of articulatory features, such as place and manner of articulation, has been shown to reduce...
Deep neural network (DNN) acoustic models can be adapted to under-resourced languages by transferrin...
The use of articulatory features, such as place and manner of articulation, has been shown to reduce...
Exploiting cross-lingual resources is an effective way to compensate for data scarcity of low resour...
We investigate multilingual modeling in the context of a deep neural network (DNN) – hidden Markov ...
Posterior-based or bottleneck features derived from neural net-works trained on out-of-domain data m...