MergedTrie: Efficient textual indexing

Ferrández, Antonio
Peral, Jesús

Open PDF

Open link

Publication date

April 2019

DOI

10.1371/journal.pone.0215288

Publisher

Public Library of Science (PLoS)

Journal

PLoS ONE

Abstract

The accessing and processing of textual information (i.e. the storing and querying of a set of strings) is especially important for many current applications (e.g. information retrieval and social networks), especially when working in the fields of Big Data or IoT, which require the handling of very large string dictionaries. Typical data structures for textual indexing are Hash Tables and some variants of Tries such as the Double Trie (DT). In this paper, we propose an extension of the DT that we have called MergedTrie. It improves the DT compression by merging both Tries into a single and by segmenting the indexed term into two fixed length parts in order to balance the new Trie. Thus, a higher overlapping of both prefixes and suffixes is...

Extracted data

We use cookies to provide a better user experience.

Data Protection

MergedTrie: Efficient textual indexing

Abstract

Extracted data

MergedTrie: Efficient textual indexing

Abstract

Extracted data

Related items

Related items