Abstract. Current Arabic lexicons, whether computational or otherwise, make no distinction between entries from Modern Standard Arabic (MSA) and Classi-cal Arabic (CA), and tend to include obsolete words that are not attested in current usage. We address this problem by building a large-scale, corpus-based lexical database that is representative of MSA. We use an MSA corpus of 1,089,111,204 words, a pre-annotation tool, machine learning techniques, and knowledge-based templatic matching to automatically acquire and filter lexical knowledge about morpho-syntactic attributes and inflection paradigms. Our lexical database is scal-able, interoperable and suitable for constructing a morphological analyser, re-gardless of the design approach and ...
Unknown words, or out of vocabulary words (OOV), cause a significant problem to morphological analys...
Broad-coverage language resources which provide prior linguistic knowledge must improve the accuracy...
We describe the generation of an Arabic full-form lexicon and its conversion into a two-level Finite...
We develop an open-source large-scale finite-state morphological processing toolkit (AraComLex) for M...
We develop an open-source large-scale finite-state morphological processing toolkit (AraComLex) for M...
We develop an open-source large-scale finite-state morphological processing toolkit (Ara-ComLex) for...
We provide lexical profiling for Arabic by covering two important linguistic aspects of Arabic lexic...
Broad-coverage language resources which provide prior linguistic knowledge must improve the accuracy...
Broad-coverage language resources which provide prior linguistic knowledge must improve the accuracy...
This article describes the construction of a lexicon and a morphological description for standard Ar...
International audienceWe describe a lexicon of Arabic verbs constructed on the basis of Semitic patt...
International audienceWe describe a lexicon of Arabic verbs constructed on the basis of Semitic patt...
Applications of statistical Arabic NLP in general, and text mining in specific, along with the tools...
There has been extensive work on Arabic morphology, lexicography and syn-tax resulting in many resou...
We present an XML approach for the production of an Arabic morphological database for Arabic languag...
Unknown words, or out of vocabulary words (OOV), cause a significant problem to morphological analys...
Broad-coverage language resources which provide prior linguistic knowledge must improve the accuracy...
We describe the generation of an Arabic full-form lexicon and its conversion into a two-level Finite...
We develop an open-source large-scale finite-state morphological processing toolkit (AraComLex) for M...
We develop an open-source large-scale finite-state morphological processing toolkit (AraComLex) for M...
We develop an open-source large-scale finite-state morphological processing toolkit (Ara-ComLex) for...
We provide lexical profiling for Arabic by covering two important linguistic aspects of Arabic lexic...
Broad-coverage language resources which provide prior linguistic knowledge must improve the accuracy...
Broad-coverage language resources which provide prior linguistic knowledge must improve the accuracy...
This article describes the construction of a lexicon and a morphological description for standard Ar...
International audienceWe describe a lexicon of Arabic verbs constructed on the basis of Semitic patt...
International audienceWe describe a lexicon of Arabic verbs constructed on the basis of Semitic patt...
Applications of statistical Arabic NLP in general, and text mining in specific, along with the tools...
There has been extensive work on Arabic morphology, lexicography and syn-tax resulting in many resou...
We present an XML approach for the production of an Arabic morphological database for Arabic languag...
Unknown words, or out of vocabulary words (OOV), cause a significant problem to morphological analys...
Broad-coverage language resources which provide prior linguistic knowledge must improve the accuracy...
We describe the generation of an Arabic full-form lexicon and its conversion into a two-level Finite...