In this paper we describe an approach that both creates crosslingual acoustic monophone model sets for speech recognition tasks and objectively predicts their performance without target-language speech data or acoustic measurement techniques. This strategy is based on a series of linguistic metrics characterizing the articulatory phonetic and phonological distances of target-language phonemes from source-language phonemes. We term these algorithms the Combined Phonetic and Phonological Crosslingual Distance (CPP-CD) metric and the Combined Phonetic and Phonological Crosslingual Prediction (CPP-CP) metric. The particular motivations for this project are the current unavailability and often prohibitively high production cost of speech databas...
Only a handful of the world’s languages are abundant with the resources that enable practical applic...
AFIT/GE/ENG/06-62 Current speech recognition systems tend to be developed only for commercially viab...
The idea of combining multiple languages’ recordings to train a single automatic speech recognition ...
Multilingual speech recognition systems mostly benefit low resource languages but suffer degradation...
The development of a speech recognition system requires at least three resources: a large labeled sp...
Automatic speech recognition requires many hours of transcribed speech recordings in order for an ac...
Cross-language speech recognition often assumes a certain amount of knowledge about the target langu...
This paper presents a novel acoustic modeling technique of large vocabulary automatic speech recogni...
Multilingual automatic speech recognition (ASR) systems mostly benefit low resource languages but su...
A state-of-the-art automatic speech recognition (ASR) system can often achieve high accuracy for mos...
Exploiting cross-lingual resources is an effective way to compensate for data scarcity of low resour...
In recent times, the improved levels of accuracy obtained by Automatic Speech Recognition (ASR) tech...
Over the past decades, speech recognition has dramatically improved in a large variety of applicatio...
A method is proposed for implementing the cross-lingual porting of recognition models for rapid prot...
This paper presents the work on crosslingual speech recognition carried out by the MASPER initiative...
Only a handful of the world’s languages are abundant with the resources that enable practical applic...
AFIT/GE/ENG/06-62 Current speech recognition systems tend to be developed only for commercially viab...
The idea of combining multiple languages’ recordings to train a single automatic speech recognition ...
Multilingual speech recognition systems mostly benefit low resource languages but suffer degradation...
The development of a speech recognition system requires at least three resources: a large labeled sp...
Automatic speech recognition requires many hours of transcribed speech recordings in order for an ac...
Cross-language speech recognition often assumes a certain amount of knowledge about the target langu...
This paper presents a novel acoustic modeling technique of large vocabulary automatic speech recogni...
Multilingual automatic speech recognition (ASR) systems mostly benefit low resource languages but su...
A state-of-the-art automatic speech recognition (ASR) system can often achieve high accuracy for mos...
Exploiting cross-lingual resources is an effective way to compensate for data scarcity of low resour...
In recent times, the improved levels of accuracy obtained by Automatic Speech Recognition (ASR) tech...
Over the past decades, speech recognition has dramatically improved in a large variety of applicatio...
A method is proposed for implementing the cross-lingual porting of recognition models for rapid prot...
This paper presents the work on crosslingual speech recognition carried out by the MASPER initiative...
Only a handful of the world’s languages are abundant with the resources that enable practical applic...
AFIT/GE/ENG/06-62 Current speech recognition systems tend to be developed only for commercially viab...
The idea of combining multiple languages’ recordings to train a single automatic speech recognition ...