Just-In-Time Language Modelling

Adam Berger
Robert Miller

Publication date

January 1998

Abstract

Traditional approaches to language modelling have relied on a fixed corpus of text to inform the parameters of a probability distribution over word sequences. Increasing the corpus size often leads to better-performing language models, but no matter how large, the corpus is a static entity, unable to reflect information about events which postdate it. In these pages we introduce an online paradigm which interleaves the estimation and application of a language model. We present a Bayesian approach to online language modelling, in which the marginal probabilities of a static trigram model are dynamically updated to match the topic being dictated to the system. We also describe the architecture of a prototype we have implemented which uses the...

Extracted data

We use cookies to provide a better user experience.

Data Protection

Just-In-Time Language Modelling

Abstract

Extracted data

Just-In-Time Language Modelling

Abstract

Extracted data

Related items

Related items