In the spoken language translation pipeline, machine translation systems that are trained solely on written bitexts are often unable to recover from speech recognition errors due to the mismatch in training data. We propose a novel technique to simulate the errors generated by an ASR system, using the ASR system’s pronunciation dictionary and language model. Lexical entries in the pronunciation dictionary are converted into phoneme sequences using a text-to-speech (TTS) analyzer and stored in a phoneme-to-word translation model. The translation model and ASR language model are combined into a phoneme-to-word MT system that “damages” clean texts to look like ASR outputs based on acoustic confusions. Training texts are TTS-converted and damag...
Neural machine translation models have shown to achieve high quality when trained and fed with well ...
Neural machine translation models have shown to achieve high quality when trained and fed with well ...
International audienceText-to-speech (TTS) systems are built on speech corpora which are labeled wit...
In the spoken language translation pipeline, machine translation systems that are trained solely on ...
<p>We propose a novel technique for adapting text-based statistical machine translation to deal with...
Spoken language translation (SLT) exists within one of the most challenging intersections of speech ...
Error propagation from automatic speech recognition (ASR) to machine translation (MT) is a critical ...
Speech-to-speech translation is a challenging task mixing two of the most ambitious Natural Language...
We report insights from translating Spanish conversational telephone speech into English text by cas...
In spoken language translation, integration of the ASR and MT components is critical for good perfor...
Pelemans J., Vanallemeersch T., Demuynck K., Verwimp L., Van hamme H., Wambacq P., ''Language model ...
This thesis addresses the problems of phonemic variability and confusability from the pronunciation ...
For automatic speech translation (AST), end-to-end approaches are outperformed by cascaded models th...
Rapid deployment of automatic speech recognition (ASR) in new languages, with very limited data, is ...
Some practical uses of ASR have been implemented, including the transcription of meetings and the us...
Neural machine translation models have shown to achieve high quality when trained and fed with well ...
Neural machine translation models have shown to achieve high quality when trained and fed with well ...
International audienceText-to-speech (TTS) systems are built on speech corpora which are labeled wit...
In the spoken language translation pipeline, machine translation systems that are trained solely on ...
<p>We propose a novel technique for adapting text-based statistical machine translation to deal with...
Spoken language translation (SLT) exists within one of the most challenging intersections of speech ...
Error propagation from automatic speech recognition (ASR) to machine translation (MT) is a critical ...
Speech-to-speech translation is a challenging task mixing two of the most ambitious Natural Language...
We report insights from translating Spanish conversational telephone speech into English text by cas...
In spoken language translation, integration of the ASR and MT components is critical for good perfor...
Pelemans J., Vanallemeersch T., Demuynck K., Verwimp L., Van hamme H., Wambacq P., ''Language model ...
This thesis addresses the problems of phonemic variability and confusability from the pronunciation ...
For automatic speech translation (AST), end-to-end approaches are outperformed by cascaded models th...
Rapid deployment of automatic speech recognition (ASR) in new languages, with very limited data, is ...
Some practical uses of ASR have been implemented, including the transcription of meetings and the us...
Neural machine translation models have shown to achieve high quality when trained and fed with well ...
Neural machine translation models have shown to achieve high quality when trained and fed with well ...
International audienceText-to-speech (TTS) systems are built on speech corpora which are labeled wit...