Nowadays, the interest in code-mixing has become ubiquitous in Natural Language Processing (NLP); however, not much attention has been given to address this phenomenon for Speech Translation (ST) task. This can be solely attributed to the lack of code-mixed ST task labelled data. Thus, we introduce Prabhupadavani, which is a multilingual code-mixed ST dataset for 25 languages. It is multi-domain, covers ten language families, containing 94 hours of speech by 130+ speakers, manually aligned with corresponding text in the target language. The Prabhupadavani is about Vedic culture and heritage from Indic literature, where code-switching in the case of quotation from literature is important in the context of humanities teaching. To the best of ...
Abstract—Machine Translation pertains to translation of one natural language to other by using autom...
Code-mixing or language-mixing is a linguistic phenomenon where multiple language mix together durin...
In the present communication-based society, no natural language seems to have been left untouched by...
Code-mixing is the phenomenon of using more than one language in a sentence. It is a very frequently...
This paper describes the development of a multilingual, manually annotated dataset for three under-r...
Abstract: The Dravidian languages are spoken all over the world. Despite their distinctiveness, Drav...
In recent years, the multilingual content over the internet has grown exponentially together with th...
This paper describes the development of a multilingual, manually annotated dataset for three under-r...
Training multilingual automatic speech recognition (ASR) systems is challenging because acoustic and...
Linguistic code switching (LCS) occurs when speakers mix multiple languages in the same speech utter...
The analysis of data in which multiple languages are represented has gained popularity among computa...
Multimodal machine translation is the task of translating from a source text into the target langu...
This Natural Langauge processing is carried particularly on English-Kannada/Telugu. Kannada is a lan...
HinDialect: 26 Hindi-related languages and dialects of the Indic Continuum in North India Languag...
Multimodal machine translation is the task of translating from source language to target language us...
Abstract—Machine Translation pertains to translation of one natural language to other by using autom...
Code-mixing or language-mixing is a linguistic phenomenon where multiple language mix together durin...
In the present communication-based society, no natural language seems to have been left untouched by...
Code-mixing is the phenomenon of using more than one language in a sentence. It is a very frequently...
This paper describes the development of a multilingual, manually annotated dataset for three under-r...
Abstract: The Dravidian languages are spoken all over the world. Despite their distinctiveness, Drav...
In recent years, the multilingual content over the internet has grown exponentially together with th...
This paper describes the development of a multilingual, manually annotated dataset for three under-r...
Training multilingual automatic speech recognition (ASR) systems is challenging because acoustic and...
Linguistic code switching (LCS) occurs when speakers mix multiple languages in the same speech utter...
The analysis of data in which multiple languages are represented has gained popularity among computa...
Multimodal machine translation is the task of translating from a source text into the target langu...
This Natural Langauge processing is carried particularly on English-Kannada/Telugu. Kannada is a lan...
HinDialect: 26 Hindi-related languages and dialects of the Indic Continuum in North India Languag...
Multimodal machine translation is the task of translating from source language to target language us...
Abstract—Machine Translation pertains to translation of one natural language to other by using autom...
Code-mixing or language-mixing is a linguistic phenomenon where multiple language mix together durin...
In the present communication-based society, no natural language seems to have been left untouched by...