The use of multilingual language models for tasks in low and high-resource languages has been a success story in deep learning. In recent times, Arabic has been receiving widespread attention on account of its dialectal variance. While prior research studies have tried to adapt these multilingual models for dialectal variants of Arabic, it still remains a challenging problem owing to the lack of sufficient monolingual dialectal data and parallel translation data of such dialectal variants. It remains an open problem on whether the limited dialectical data can be used to improve the models trained in Arabic on its dialectal variants. First, we show that multilingual-BERT (mBERT) incrementally pretrained on Arabic monolingual data takes less ...
A major problem with dialectal Arabic speech recognition is due to the sparsity of speech resources....
In this research article, we study the problem of employing a neural machine translation model to tr...
We study the effectiveness of recently developed language recognition techniques based on speech rec...
This thesis discusses different approaches to machine translation (MT) from Dialectal Arabic (DA) to...
International audienceNeural Machine Translation (NMT) systems have been shown to perform impressive...
University of Technology Sydney. Faculty of Engineering and Information Technology.Rapid growth in s...
There is a growing body of work in recent years to develop pre-trained language models (PLMs) for th...
In this paper, we present a Dialect Identification system (ArbDialectID) that competed at Task 1 of ...
Arabic dialect classification has been an important and challenging problem for Arabic language proc...
This thesis has two aims: developing resources for Arabic dialects and improving the speech recognit...
Arabic is a Semitic language which is widely spoken with many dialects. Given the success of pre-tra...
In this paper, we study the effect of different word-level preprocessing decisions for Arabic on SMT...
International audienceCreating parallel corpora is a difficult issue that many researches try to dea...
This paper addresses the classification of Arabic text data in the field of Natural Language Process...
Pretraining data The models were pretrained on ~4.4 Billion words: Arabic version of OSCAR (unsh...
A major problem with dialectal Arabic speech recognition is due to the sparsity of speech resources....
In this research article, we study the problem of employing a neural machine translation model to tr...
We study the effectiveness of recently developed language recognition techniques based on speech rec...
This thesis discusses different approaches to machine translation (MT) from Dialectal Arabic (DA) to...
International audienceNeural Machine Translation (NMT) systems have been shown to perform impressive...
University of Technology Sydney. Faculty of Engineering and Information Technology.Rapid growth in s...
There is a growing body of work in recent years to develop pre-trained language models (PLMs) for th...
In this paper, we present a Dialect Identification system (ArbDialectID) that competed at Task 1 of ...
Arabic dialect classification has been an important and challenging problem for Arabic language proc...
This thesis has two aims: developing resources for Arabic dialects and improving the speech recognit...
Arabic is a Semitic language which is widely spoken with many dialects. Given the success of pre-tra...
In this paper, we study the effect of different word-level preprocessing decisions for Arabic on SMT...
International audienceCreating parallel corpora is a difficult issue that many researches try to dea...
This paper addresses the classification of Arabic text data in the field of Natural Language Process...
Pretraining data The models were pretrained on ~4.4 Billion words: Arabic version of OSCAR (unsh...
A major problem with dialectal Arabic speech recognition is due to the sparsity of speech resources....
In this research article, we study the problem of employing a neural machine translation model to tr...
We study the effectiveness of recently developed language recognition techniques based on speech rec...