Arabic dialect classification has been an important and challenging problem for Arabic language processing, especially for social media text analysis and machine translation. In this paper we propose an approach to improving Arabic dialect classification with semi-supervised learn-ing: multiple classifiers are trained with weakly supervised, strongly supervised, and unsupervised data. Their combination yields significant and consistent improve-ment on two different test sets. The dialect classification accuracy is improved by 5% over the strongly supervised classifier and 20 % over the weakly supervised classifier. Furthermore, when applying the improved dialect classifier to build a Modern Stan-dard Arabic (MSA) language model (LM), the ne...
Modern Standard Arabic (MSA) is the formal language in most Arabic countries. Arabic Dia-lects (AD) ...
In this paper, we present our approach for profiling Arabic authors on Twitter, based on their tweet...
International audienceNeural Machine Translation (NMT) systems have been shown to perform impressive...
The paper presents work on improved sentence-level dialect classification of Egyptian Arabic (ARZ) v...
We study the effectiveness of recently developed language recognition techniques based on speech rec...
Speech dialects refer to linguistic and pronunciation variations in the speech of the same language....
This study investigates how to classify Arabic dialects in text by extracting features which show th...
We present a study on sentence-level Arabic Dialect Identification using the newly developed Multidi...
Modern Standard Arabic is the written standard across the Arab world; but there is an increasing use...
This paper describes an Arabic dialect identification system which we developed for the Discriminati...
Automatic Language Identification (ALI) is the first necessary step to do any language-dependent nat...
Recent computational work on Arabic dialect identification has focused primarily on building and ann...
A major problem with dialectal Arabic speech recognition is due to the sparsity of speech resources....
Several recent papers on Arabic dialect identi-fication have hinted that using a word unigram model ...
Novel Techniques for Dialectal Arabic Speech describes approaches to improve automatic speech recogn...
Modern Standard Arabic (MSA) is the formal language in most Arabic countries. Arabic Dia-lects (AD) ...
In this paper, we present our approach for profiling Arabic authors on Twitter, based on their tweet...
International audienceNeural Machine Translation (NMT) systems have been shown to perform impressive...
The paper presents work on improved sentence-level dialect classification of Egyptian Arabic (ARZ) v...
We study the effectiveness of recently developed language recognition techniques based on speech rec...
Speech dialects refer to linguistic and pronunciation variations in the speech of the same language....
This study investigates how to classify Arabic dialects in text by extracting features which show th...
We present a study on sentence-level Arabic Dialect Identification using the newly developed Multidi...
Modern Standard Arabic is the written standard across the Arab world; but there is an increasing use...
This paper describes an Arabic dialect identification system which we developed for the Discriminati...
Automatic Language Identification (ALI) is the first necessary step to do any language-dependent nat...
Recent computational work on Arabic dialect identification has focused primarily on building and ann...
A major problem with dialectal Arabic speech recognition is due to the sparsity of speech resources....
Several recent papers on Arabic dialect identi-fication have hinted that using a word unigram model ...
Novel Techniques for Dialectal Arabic Speech describes approaches to improve automatic speech recogn...
Modern Standard Arabic (MSA) is the formal language in most Arabic countries. Arabic Dia-lects (AD) ...
In this paper, we present our approach for profiling Arabic authors on Twitter, based on their tweet...
International audienceNeural Machine Translation (NMT) systems have been shown to perform impressive...