ABSTRACT This paper describes the integration of language identification (LID) into a multilingual automatic speech recognition (ASR) system for spoken conversations containing code-switches between Mandarin and English. We apply a multistream approach to combine at frame level the acoustic model score and the language information, where the latter is provided by an LID component. Furthermore, we advance this multistream approach by a new method called "Language Lookahead", in which the language information of subsequent frames is used to improve accuracy. Both methods are evaluated using a set of controlled LID results with varying frame accuracies. Our results show that both approaches improve the ASR performance by at least 4% ...
The bi-encoder structure has been intensively investigated in code-switching (CS) automatic speech r...
Training multilingual automatic speech recognition (ASR) systems is challenging because acoustic and...
This master's thesis deals with code-switching detection in speech. The state-of-the-art methods of ...
<p>This paper describes the integration of language identification (LID) into a multilingual automat...
Abstract: In this paper, phoneme sequences are used as language information to perform code-switched...
The process of determining the language of a speech utterance is called Language Identification (LID...
This paper presents first steps toward a large vocabulary continuous speech recognition system (LVCS...
Code-switching (CS) in spoken language is where the speech has two or more languages within an utter...
Spoken language identification (LID) refers to the automatic process of determining the identity of ...
International audienceSpeakers in multilingual communities often switch between or mix multiple lang...
Code-switching deals with alternative languages in communication process. Training end-to-end (E2E) ...
Automatic spoken Language Identi¯cation (LID) is the process of identifying the language spoken with...
We present two concepts for systems with language identification in the context of multilingual info...
A data-driven computational approach is adopted to reveal significant pronunciation variations in Ca...
Articulatory features (AFs) provide language-independent attribute by exploiting the speech producti...
The bi-encoder structure has been intensively investigated in code-switching (CS) automatic speech r...
Training multilingual automatic speech recognition (ASR) systems is challenging because acoustic and...
This master's thesis deals with code-switching detection in speech. The state-of-the-art methods of ...
<p>This paper describes the integration of language identification (LID) into a multilingual automat...
Abstract: In this paper, phoneme sequences are used as language information to perform code-switched...
The process of determining the language of a speech utterance is called Language Identification (LID...
This paper presents first steps toward a large vocabulary continuous speech recognition system (LVCS...
Code-switching (CS) in spoken language is where the speech has two or more languages within an utter...
Spoken language identification (LID) refers to the automatic process of determining the identity of ...
International audienceSpeakers in multilingual communities often switch between or mix multiple lang...
Code-switching deals with alternative languages in communication process. Training end-to-end (E2E) ...
Automatic spoken Language Identi¯cation (LID) is the process of identifying the language spoken with...
We present two concepts for systems with language identification in the context of multilingual info...
A data-driven computational approach is adopted to reveal significant pronunciation variations in Ca...
Articulatory features (AFs) provide language-independent attribute by exploiting the speech producti...
The bi-encoder structure has been intensively investigated in code-switching (CS) automatic speech r...
Training multilingual automatic speech recognition (ASR) systems is challenging because acoustic and...
This master's thesis deals with code-switching detection in speech. The state-of-the-art methods of ...