The phenomenon where a speaker mixes two or more languages within the same conversation is called code-switching (CS). Handling CS is challenging for automatic speech recognition (ASR) and text-to-speech (TTS) because it requires coping with multilingual input. Although CS text or speech may be found in social media, the datasets of CS speech and corresponding CS transcriptions are hard to obtain even though they are required for supervised training. This work adopts a deep learning-based machine speech chain to train CS ASR and CS TTS with each other with semisupervised learning. After supervised learning with monolingual data, the machine speech chain is then carried out with unsupervised learning of either the CS text or speech. The resu...
The study of code-switching (CS) speech has produced a wealth of knowledge in the understanding of b...
International audienceSelf-supervised learning from raw speech has been proven beneficial to improve...
We propose a) a Language Agnostic end-to-end Speech Translation model (LAST), and b) a data augmenta...
Code-switching (CS) in spoken language is where the speech has two or more languages within an utter...
A Recurrent Neural Networks (RNN) based attention model has been used in code-switching speech recog...
Code switching (CS) is a natural phenomenon that is often observed in multilingual speakers. These ...
Code-switching deals with alternative languages in communication process. Training end-to-end (E2E) ...
Generally the existing monolingual corpora are not suitable for large vocabulary continuous speech r...
The thesis is a replication of the work by Takaaki Hori and his colleagues (2019), which introduces ...
This work explores multilingual speech synthesis. We compare three models based on Tacotron that uti...
This paper describes the integration of language identification (LID) into a multilingual automatic ...
This paper presents first steps toward a large vocabulary continuous speech recognition system (LVCS...
In this study, we present improvements in N-best rescoring of code-switched speech achieved by n-gra...
In this article, we propose a simple yet effective approach to train an end-to-end speech recognitio...
Code-Switching (CSW) is a common phenomenon that occurs in multilingual geographic or social context...
The study of code-switching (CS) speech has produced a wealth of knowledge in the understanding of b...
International audienceSelf-supervised learning from raw speech has been proven beneficial to improve...
We propose a) a Language Agnostic end-to-end Speech Translation model (LAST), and b) a data augmenta...
Code-switching (CS) in spoken language is where the speech has two or more languages within an utter...
A Recurrent Neural Networks (RNN) based attention model has been used in code-switching speech recog...
Code switching (CS) is a natural phenomenon that is often observed in multilingual speakers. These ...
Code-switching deals with alternative languages in communication process. Training end-to-end (E2E) ...
Generally the existing monolingual corpora are not suitable for large vocabulary continuous speech r...
The thesis is a replication of the work by Takaaki Hori and his colleagues (2019), which introduces ...
This work explores multilingual speech synthesis. We compare three models based on Tacotron that uti...
This paper describes the integration of language identification (LID) into a multilingual automatic ...
This paper presents first steps toward a large vocabulary continuous speech recognition system (LVCS...
In this study, we present improvements in N-best rescoring of code-switched speech achieved by n-gra...
In this article, we propose a simple yet effective approach to train an end-to-end speech recognitio...
Code-Switching (CSW) is a common phenomenon that occurs in multilingual geographic or social context...
The study of code-switching (CS) speech has produced a wealth of knowledge in the understanding of b...
International audienceSelf-supervised learning from raw speech has been proven beneficial to improve...
We propose a) a Language Agnostic end-to-end Speech Translation model (LAST), and b) a data augmenta...