Persian with its about 100,000,000 speakers in the world belongs to the group of languages with less developed linguistically annotated resources and tools. The few existing resources and tools are neither open source nor freely available. Thus, our goal is to develop open source resources such as corpora and treebanks, and tools for data-driven linguistic analysis of Persian. We do this by exploring the reusability of existing resources and adapting state-of-the-art methods for the linguistic annotation. We present fully functional tools for text normalization, sentence segmentation, tokenization, part-of-speech tagging, and parsing. As for resources, we describe the Uppsala PErsian Corpus (UPEC) which is a modified version of the Bijankha...
Vocabulary building of a foreign language is one of the difficult tasks for non-native learners and ...
AbstractIn this paper we present and justify methodological principles and syntactic criteria to des...
We present two dependency parsers for Persian, MaltParser and MSTParser, trained on theUppsala PErsi...
Persian with its about 100,000,000 speakers in the world belongs to the group of languages with less...
We present the Uppsala Persian Dependency Treebank (UPDT) with a syntactic annotation scheme based o...
This thesis presents open source resources in the form of annotated corpora and modules for automati...
The Persian Universal Dependency Treebank (Persian UD) is a recent effort of treebanking Persian wit...
Currently, most linguistic studies benefit from valid linguistic data available at corpora. Compilin...
One of the primary tools used in text processing tasks such as information retrieval, text extractio...
Collocation is an important level of lexis and its significance in the pedagogy of a language is an ...
Collocation is an important level of lexis and its significance in the pedagogy of a language is an ...
AbstractIn this paper we present and justify methodological principles and syntactic criteria to des...
Persian has two distinct variants, the literary formal form that has been traditionally used in writ...
Abstract—In this paper we introduce a method for part-of-speech disambiguation of Persian texts, whi...
Vocabulary building of a foreign language is one of the difficult tasks for non-native learners and ...
Vocabulary building of a foreign language is one of the difficult tasks for non-native learners and ...
AbstractIn this paper we present and justify methodological principles and syntactic criteria to des...
We present two dependency parsers for Persian, MaltParser and MSTParser, trained on theUppsala PErsi...
Persian with its about 100,000,000 speakers in the world belongs to the group of languages with less...
We present the Uppsala Persian Dependency Treebank (UPDT) with a syntactic annotation scheme based o...
This thesis presents open source resources in the form of annotated corpora and modules for automati...
The Persian Universal Dependency Treebank (Persian UD) is a recent effort of treebanking Persian wit...
Currently, most linguistic studies benefit from valid linguistic data available at corpora. Compilin...
One of the primary tools used in text processing tasks such as information retrieval, text extractio...
Collocation is an important level of lexis and its significance in the pedagogy of a language is an ...
Collocation is an important level of lexis and its significance in the pedagogy of a language is an ...
AbstractIn this paper we present and justify methodological principles and syntactic criteria to des...
Persian has two distinct variants, the literary formal form that has been traditionally used in writ...
Abstract—In this paper we introduce a method for part-of-speech disambiguation of Persian texts, whi...
Vocabulary building of a foreign language is one of the difficult tasks for non-native learners and ...
Vocabulary building of a foreign language is one of the difficult tasks for non-native learners and ...
AbstractIn this paper we present and justify methodological principles and syntactic criteria to des...
We present two dependency parsers for Persian, MaltParser and MSTParser, trained on theUppsala PErsi...