International audienceSpeakers in multilingual communities often switch between or mix multiple languages in the same conversation. Automatic Speech Recognition (ASR) of code-switched speech faces many challenges including the influence of phones of different languages on each other. This paper shows evidence that phone sharing between languages improves the Acoustic Model performance for Hindi-English code-switched speech. We compare base-line system built with separate phones for Hindi and English with systems where the phones were manually merged based on linguistic knowledge. Encouraged by the improved ASR performance after manually merging the phones, we further investigate multiple data-driven methods to identify phones to be merged a...
In this paper, phone-to-word transduction is first investigated by coupling a speech recognizer, ge...
Abstract—Many studies have explored on the usage of existing multilingual speech corpora to build an...
We propose a) a Language Agnostic end-to-end Speech Translation model (LAST), and b) a data augmenta...
Only a handful of the world’s languages are abundant with the resources that enable practical applic...
Code-switching (CS) in spoken language is where the speech has two or more languages within an utter...
Abstract: In this paper, phoneme sequences are used as language information to perform code-switched...
Training multilingual automatic speech recognition (ASR) systems is challenging because acoustic and...
ABSTRACT This paper describes the integration of language identification (LID) into a multilingual a...
A state-of-the-art automatic speech recognition (ASR) system can often achieve high accuracy for mos...
Recent methods in speech and language technology pretrain very large models which are fine-tuned for...
Code switching (CS) is a natural phenomenon that is often observed in multilingual speakers. These ...
Languages in Malaysia are dying in an alarming rate. As of today, 15 languages are in danger while t...
In recent times, the improved levels of accuracy obtained by Automatic Speech Recognition (ASR) tech...
A data-driven computational approach is adopted to reveal significant pronunciation variations in Ca...
Code switching (the process of switching from one language to another during a conversation) is a co...
In this paper, phone-to-word transduction is first investigated by coupling a speech recognizer, ge...
Abstract—Many studies have explored on the usage of existing multilingual speech corpora to build an...
We propose a) a Language Agnostic end-to-end Speech Translation model (LAST), and b) a data augmenta...
Only a handful of the world’s languages are abundant with the resources that enable practical applic...
Code-switching (CS) in spoken language is where the speech has two or more languages within an utter...
Abstract: In this paper, phoneme sequences are used as language information to perform code-switched...
Training multilingual automatic speech recognition (ASR) systems is challenging because acoustic and...
ABSTRACT This paper describes the integration of language identification (LID) into a multilingual a...
A state-of-the-art automatic speech recognition (ASR) system can often achieve high accuracy for mos...
Recent methods in speech and language technology pretrain very large models which are fine-tuned for...
Code switching (CS) is a natural phenomenon that is often observed in multilingual speakers. These ...
Languages in Malaysia are dying in an alarming rate. As of today, 15 languages are in danger while t...
In recent times, the improved levels of accuracy obtained by Automatic Speech Recognition (ASR) tech...
A data-driven computational approach is adopted to reveal significant pronunciation variations in Ca...
Code switching (the process of switching from one language to another during a conversation) is a co...
In this paper, phone-to-word transduction is first investigated by coupling a speech recognizer, ge...
Abstract—Many studies have explored on the usage of existing multilingual speech corpora to build an...
We propose a) a Language Agnostic end-to-end Speech Translation model (LAST), and b) a data augmenta...