Statistical language modeling remains a challenging task, in particular for morphologically rich languages. Recently, new approaches based on factored language models have been developed to address this problem. These models provide principled ways of including additional conditioning variables other than the preceding words, such as morphological or syntactic features. However, the number of possible choices for model parameters creates a large space of models that cannot be searched exhaustively. This paper presents an entirely data-driven model selection procedure based on genetic search, which is shown to outperform both knowledge-based and random selection procedures on two different language modeling tasks (Arabic and Turkish).
In recent years neural language models (LMs) have set state-of-the-art performance for several bench...
This paper presents some very prelimi-nary results for and problems in develop-ing a statistical mac...
A language model (LM) is a probability distribution over all possible word sequences. It is a vital ...
Currently, N-gram models are the most common and widely used models for statistical language modelin...
Currently, N-gram models are the most common and widely used models for statistical language modelin...
This chapter presents an overview of language modeling followed by a discussion of the challenges in...
AbstractThis paper explores the use of linguistic information for the selection of data to train lan...
Every normal child who is exposed to linguistic input in an interactional environment acquires the c...
Language modeling is a difficult problem for languages with rich morphology. In this paper we invest...
Language modeling is critical and indispensable for many natural language ap-plications such as auto...
We propose a Finite State Machine framework for Arabic Language Modeling. The framework provides sev...
Language modeling is an important part for both speech recognition and machine translation systems. ...
In recent years neural language models (LMs) have set state-of-the-art performance for several bench...
This paper presents some very preliminary results for and problems in developing a statistical machi...
Neural architectures are prominent in the construction of language models (LMs). However, word-leve...
In recent years neural language models (LMs) have set state-of-the-art performance for several bench...
This paper presents some very prelimi-nary results for and problems in develop-ing a statistical mac...
A language model (LM) is a probability distribution over all possible word sequences. It is a vital ...
Currently, N-gram models are the most common and widely used models for statistical language modelin...
Currently, N-gram models are the most common and widely used models for statistical language modelin...
This chapter presents an overview of language modeling followed by a discussion of the challenges in...
AbstractThis paper explores the use of linguistic information for the selection of data to train lan...
Every normal child who is exposed to linguistic input in an interactional environment acquires the c...
Language modeling is a difficult problem for languages with rich morphology. In this paper we invest...
Language modeling is critical and indispensable for many natural language ap-plications such as auto...
We propose a Finite State Machine framework for Arabic Language Modeling. The framework provides sev...
Language modeling is an important part for both speech recognition and machine translation systems. ...
In recent years neural language models (LMs) have set state-of-the-art performance for several bench...
This paper presents some very preliminary results for and problems in developing a statistical machi...
Neural architectures are prominent in the construction of language models (LMs). However, word-leve...
In recent years neural language models (LMs) have set state-of-the-art performance for several bench...
This paper presents some very prelimi-nary results for and problems in develop-ing a statistical mac...
A language model (LM) is a probability distribution over all possible word sequences. It is a vital ...