Dravidian languages, such as Kannada and Tamil, are notoriously difficult to translate by state-of-the-art neural models. This stems from the fact that these languages are morphologically very rich as well as being low-resourced. In this paper, we focus on subword segmentation and evaluate Linguistically Motivated Vocabulary Reduction (LMVR) against the more commonly used SentencePiece (SP) for the task of translating from English into four different Dravidian languages. Additionally we investigate the optimal subword vocabulary size for each language. We find that SP is the overall best choice for segmentation, and that larger dictionary sizes lead to higher translation quality
In this proposed work, word re-ordering rules for English to Dravidian languages Machine Translation...
Abstract: The Dravidian languages are spoken all over the world. Despite their distinctiveness, Drav...
This Natural Langauge processing is carried particularly on English-Kannada/Telugu. Kannada is a lan...
Dravidian languages, such as Kannada and Tamil, are notoriously difficult to translate by state-of-t...
This paper describes our submission for the English-Tamil news translation task of WMT-2020. The var...
"Unsupervised Neural Machine translation (UNMT) is beneficial especially for under-resourced languag...
Neural machine translation (NMT) models are conventionally trained with fixed-size vocabu- laries to...
Wordnets are extensively used in natural language processing, but the current approaches for manual...
Wordnets are extensively used in natural language processing, but the current approaches for manuall...
This paper describes our submission for the English-Tamil news translation task of WMT-2020. The var...
Myanmar sentences are written as contiguoussequences of syllables with no characters delimiting thew...
Machine translation is the process of translating a document from one language to another with the a...
Natural Language Processing is a field of computer science, AI and linguistics concerned with the in...
Machine Translation bridges communication barriers and eases interaction among people having differe...
Subword segmenters like BPE operate as a pre-processing step in neural machine translation and othe...
In this proposed work, word re-ordering rules for English to Dravidian languages Machine Translation...
Abstract: The Dravidian languages are spoken all over the world. Despite their distinctiveness, Drav...
This Natural Langauge processing is carried particularly on English-Kannada/Telugu. Kannada is a lan...
Dravidian languages, such as Kannada and Tamil, are notoriously difficult to translate by state-of-t...
This paper describes our submission for the English-Tamil news translation task of WMT-2020. The var...
"Unsupervised Neural Machine translation (UNMT) is beneficial especially for under-resourced languag...
Neural machine translation (NMT) models are conventionally trained with fixed-size vocabu- laries to...
Wordnets are extensively used in natural language processing, but the current approaches for manual...
Wordnets are extensively used in natural language processing, but the current approaches for manuall...
This paper describes our submission for the English-Tamil news translation task of WMT-2020. The var...
Myanmar sentences are written as contiguoussequences of syllables with no characters delimiting thew...
Machine translation is the process of translating a document from one language to another with the a...
Natural Language Processing is a field of computer science, AI and linguistics concerned with the in...
Machine Translation bridges communication barriers and eases interaction among people having differe...
Subword segmenters like BPE operate as a pre-processing step in neural machine translation and othe...
In this proposed work, word re-ordering rules for English to Dravidian languages Machine Translation...
Abstract: The Dravidian languages are spoken all over the world. Despite their distinctiveness, Drav...
This Natural Langauge processing is carried particularly on English-Kannada/Telugu. Kannada is a lan...