Abstract. This paper investigates a variety of statistical cache-based language models built upon three corpora: English, Lithuanian, and Lithuanian base forms. The impact of the cache size, type of the decay function, including custom corpus derived functions, and interpolation technique (static vs. dynamic) on the perplexity of a language model is studied. The best results are achieved by models consisting of 3 components: standard 3-gram, decaying cache 1-gram and decaying cache 2-gram that are joined together by means of linear interpolation using the technique of dynamic weight update. Such a model led up to 36 % and 43 % perplexity improvement with respect to the 3-gram baseline for Lithuanian words and Lithuanian word base forms resp...
Statistical language models encapsulate varied information, both grammatical and semantic, present i...
One particular problem in large vocabulary continuous speech recognition for low-resourced languages...
The n-gram language model, which has its roots in statistical natural language processing, has been ...
This paper investigates a variety of statistical cache-based language models built upon three corpor...
This paper presents state of the art language modeling (LM) of Lithuanian, which is highly inflected...
This paper describes our research on statistical language modeling of Lithuanian. The idea of improv...
Abstract. This paper describes our research on statistical language modeling of Lithuanian. The idea...
In this paper we examine several combinations of classical N-gram language models with more advanced...
It is shown that the enormous improvement in the size of disk storage space in recent years can be u...
Statistical language modelling may not only be used to uncover the patterns which underlie the compo...
Language modeling is an important part for both speech recognition and machine translation systems. ...
Abstract: In this paper we present a synthesis of the theoretical fundamentals and some practical as...
Language models are probability distributions over a set of unilingual natural language text used in...
Article dans revue scientifique avec comité de lecture.In natural language and especially in spontan...
We introduce a novel approach for building language models based on a systematic, recursive explorat...
Statistical language models encapsulate varied information, both grammatical and semantic, present i...
One particular problem in large vocabulary continuous speech recognition for low-resourced languages...
The n-gram language model, which has its roots in statistical natural language processing, has been ...
This paper investigates a variety of statistical cache-based language models built upon three corpor...
This paper presents state of the art language modeling (LM) of Lithuanian, which is highly inflected...
This paper describes our research on statistical language modeling of Lithuanian. The idea of improv...
Abstract. This paper describes our research on statistical language modeling of Lithuanian. The idea...
In this paper we examine several combinations of classical N-gram language models with more advanced...
It is shown that the enormous improvement in the size of disk storage space in recent years can be u...
Statistical language modelling may not only be used to uncover the patterns which underlie the compo...
Language modeling is an important part for both speech recognition and machine translation systems. ...
Abstract: In this paper we present a synthesis of the theoretical fundamentals and some practical as...
Language models are probability distributions over a set of unilingual natural language text used in...
Article dans revue scientifique avec comité de lecture.In natural language and especially in spontan...
We introduce a novel approach for building language models based on a systematic, recursive explorat...
Statistical language models encapsulate varied information, both grammatical and semantic, present i...
One particular problem in large vocabulary continuous speech recognition for low-resourced languages...
The n-gram language model, which has its roots in statistical natural language processing, has been ...