In recent years highly compact succinct text indexes developed in bioinformatics have spread to the domain of natural language processing, in particular n-gram indexing. One line of research has been to utilize compressed suffix trees as both the text index and the language model. Compressed suffix trees have several favourable properties for compressing n-gram strings and associated satellite data while allowing for both fast access and fast computation of the language model probabilities over the text. When it comes to count based n-gram language models and especially to low-order n-gram models, the Kneser-Ney language model has long been de facto industry standard. Shareghi et al. showed how to utilize a compressed suffix tree to buil...
The suffix tree is a fundamental data structure in stringology, but its space usage, though linear, ...
AbstractSuffix trees are among the most important data structures in stringology, with a number of a...
The suffix tree is a fundamental data structure in stringology, but its space usage, though linear, ...
Efficient methods for storing and querying are critical for scaling high-order m-gram language model...
Efficient methods for storing and querying language models are critical for scaling to large corpora...
Two fundamental problems concern the handling of large n-gram language models: indexing, that is, co...
Two fundamental problems concern the handling of large n-gram language models: indexing, that is, co...
Two fundamental problems concern the handling of large n-gram language models: indexing, that is, co...
Two fundamental problems concern the handling of large n-gram language models: indexing, that is, co...
Two fundamental problems concern the handling of large n-gram language models: indexing, that is, co...
Two fundamental problems concern the handling of large n-gram language models: indexing, that is, co...
This paper deals with the two fundamental problems concerning the handling of large n-gram language ...
Suffix trees are one of the most versatile data structures in stringology, with many applications in...
Suffix trees are one of the most important data structures in stringology, with myriads of applicati...
The suffix tree is an extremely important data structure in bioinformatics. Classical implementation...
The suffix tree is a fundamental data structure in stringology, but its space usage, though linear, ...
AbstractSuffix trees are among the most important data structures in stringology, with a number of a...
The suffix tree is a fundamental data structure in stringology, but its space usage, though linear, ...
Efficient methods for storing and querying are critical for scaling high-order m-gram language model...
Efficient methods for storing and querying language models are critical for scaling to large corpora...
Two fundamental problems concern the handling of large n-gram language models: indexing, that is, co...
Two fundamental problems concern the handling of large n-gram language models: indexing, that is, co...
Two fundamental problems concern the handling of large n-gram language models: indexing, that is, co...
Two fundamental problems concern the handling of large n-gram language models: indexing, that is, co...
Two fundamental problems concern the handling of large n-gram language models: indexing, that is, co...
Two fundamental problems concern the handling of large n-gram language models: indexing, that is, co...
This paper deals with the two fundamental problems concerning the handling of large n-gram language ...
Suffix trees are one of the most versatile data structures in stringology, with many applications in...
Suffix trees are one of the most important data structures in stringology, with myriads of applicati...
The suffix tree is an extremely important data structure in bioinformatics. Classical implementation...
The suffix tree is a fundamental data structure in stringology, but its space usage, though linear, ...
AbstractSuffix trees are among the most important data structures in stringology, with a number of a...
The suffix tree is a fundamental data structure in stringology, but its space usage, though linear, ...