Abstract. This paper describes our research on statistical language modeling of Lithuanian. The idea of improving sparse n-gram models of highly inflected Lithuanian language by interpolating them with complex n-gram models based on word clustering and morphological word decompo-sition was investigated. Words, word base forms and part-of-speech tags were clustered into 50 to 5000 automatically generated classes. Multiple 3-gram and 4-gram class-based language models were built and evaluated on Lithuanian text corpus, which contained 85 million words. Class-based models linearly interpolated with the 3-gram model led up to a 13 % reduction in the perplexity compared with the baseline 3-gram model. Morphological models decreased out-of-vocabu...
We study class-based n-gram and neural network language models for very large vocabulary speech reco...
This paper investigates a variety of statistical cache-based language models built upon three corpor...
The article presents a brief overview of studies in the field of computational morphology in Latvian...
This paper describes our research on statistical language modeling of Lithuanian. The idea of improv...
This paper presents state of the art language modeling (LM) of Lithuanian, which is highly inflected...
As the development of information technologies makes progress, large morphologically annotated corpo...
This paper deals with the usage of parts of speech and their grammatical features in the morphologic...
It is well known that good language models improve performance of speech recognition. One requiremen...
We present the first statistical dependency parsing results for Lithuanian, a morphologically rich l...
We describe an approach for morphological analysis combining a rule-based word level morphological a...
The article presents a brief overview of studies in the field of computational morphology in Latvian...
Abstract. This paper investigates a variety of statistical cache-based language models built upon th...
The paper deals with the preliminary findings from the morphologically annotated corpus of Lithuania...
The methods that have been used for solving disambiguation of morphological ambiguity of Lithuanian ...
We describe methods for disambiguation of Lithuanian morphological ambiguity. The methods we present...
We study class-based n-gram and neural network language models for very large vocabulary speech reco...
This paper investigates a variety of statistical cache-based language models built upon three corpor...
The article presents a brief overview of studies in the field of computational morphology in Latvian...
This paper describes our research on statistical language modeling of Lithuanian. The idea of improv...
This paper presents state of the art language modeling (LM) of Lithuanian, which is highly inflected...
As the development of information technologies makes progress, large morphologically annotated corpo...
This paper deals with the usage of parts of speech and their grammatical features in the morphologic...
It is well known that good language models improve performance of speech recognition. One requiremen...
We present the first statistical dependency parsing results for Lithuanian, a morphologically rich l...
We describe an approach for morphological analysis combining a rule-based word level morphological a...
The article presents a brief overview of studies in the field of computational morphology in Latvian...
Abstract. This paper investigates a variety of statistical cache-based language models built upon th...
The paper deals with the preliminary findings from the morphologically annotated corpus of Lithuania...
The methods that have been used for solving disambiguation of morphological ambiguity of Lithuanian ...
We describe methods for disambiguation of Lithuanian morphological ambiguity. The methods we present...
We study class-based n-gram and neural network language models for very large vocabulary speech reco...
This paper investigates a variety of statistical cache-based language models built upon three corpor...
The article presents a brief overview of studies in the field of computational morphology in Latvian...