© 2016 Association for Computational Linguistics. In this paper we improve over the hierarchical Pitman-Yor processes language model in a cross-domain setting by adding skipgrams as features. We find that adding skipgram features reduces the perplexity. This reduction is substantial when models are trained on a generic corpus and tested on domain-specific corpora. We also find that within-domain testing and crossdomain testing require different backoff strategies. We observe a 30-40% reduction in perplexity in a cross-domain language modelling task, and up to 6% reduction in a within-domain experiment, for both English and Flemish-Dutch.Onrust L., van den Bosch A., Van hamme H., ''Improving cross-domain n-gram language modelling with skipgr...
Pelemans J., De Laet B., Van hamme H., Wambacq P., ''The effect of word similarity on N-gram languag...
Recent progress in variable n-gram language modeling provides an efficient representation of n-gram ...
This paper presents two techniques for language model (LM) adaptation. The first aims to build a mor...
Contains fulltext : 159825.pdf (publisher's version ) (Open Access)In this paper w...
We introduce a novel approach for building language models based on a systematic, recursive explorat...
Data sparsity is a large problem in natural language processing that refers to the fact that languag...
© 2014 Pelemans et al.. In this paper we examine several combinations of classical N-gram language m...
We introduce a novel approach for build-ing language models based on a system-atic, recursive explor...
Verwimp L., Pelemans J., Van hamme H., Wambacq P., ''Extending n-gram language models based on equiv...
© 2015 Lyan Verwimp, Joris Pelemans, Hugo Van hamme, Patrick Wambacq. The subject of this paper is t...
In natural languages the variability in the underlying linguistic generation rules significantly alt...
In domains with insufficient matched training data, language models are often constructed by interpo...
A language model combining word-based and category-based ngrams within a backoff framework is presen...
International audienceThis paper describes an extension of the n-gram language model: the similar n-...
We present a tutorial introduction to n-gram models for language modeling and survey the most widely...
Pelemans J., De Laet B., Van hamme H., Wambacq P., ''The effect of word similarity on N-gram languag...
Recent progress in variable n-gram language modeling provides an efficient representation of n-gram ...
This paper presents two techniques for language model (LM) adaptation. The first aims to build a mor...
Contains fulltext : 159825.pdf (publisher's version ) (Open Access)In this paper w...
We introduce a novel approach for building language models based on a systematic, recursive explorat...
Data sparsity is a large problem in natural language processing that refers to the fact that languag...
© 2014 Pelemans et al.. In this paper we examine several combinations of classical N-gram language m...
We introduce a novel approach for build-ing language models based on a system-atic, recursive explor...
Verwimp L., Pelemans J., Van hamme H., Wambacq P., ''Extending n-gram language models based on equiv...
© 2015 Lyan Verwimp, Joris Pelemans, Hugo Van hamme, Patrick Wambacq. The subject of this paper is t...
In natural languages the variability in the underlying linguistic generation rules significantly alt...
In domains with insufficient matched training data, language models are often constructed by interpo...
A language model combining word-based and category-based ngrams within a backoff framework is presen...
International audienceThis paper describes an extension of the n-gram language model: the similar n-...
We present a tutorial introduction to n-gram models for language modeling and survey the most widely...
Pelemans J., De Laet B., Van hamme H., Wambacq P., ''The effect of word similarity on N-gram languag...
Recent progress in variable n-gram language modeling provides an efficient representation of n-gram ...
This paper presents two techniques for language model (LM) adaptation. The first aims to build a mor...