Full version of a paper accepted for presentation at the 31st Annual Symposium on Combinatorial Pattern Matching (CPM 2020)We study a document retrieval problem in the new framework where $D$ text documents are organized in a {\em category tree} with a pre-defined number $h$ of categories. This situation occurs e.g. with taxomonic trees in biology or subject classification systems for scientific literature. Given a string pattern $p$ and a category (level in the category tree), we wish to efficiently retrieve the $t$ \emph{categorical units} containing this pattern and belonging to the category. We propose several efficient solutions for this problem. One of them uses $n(\log\sigma(1+o(1))+\log D+O(h)) + O(\Delta)$ bits of space and $O(|p|+...
Let D={T1,T2,…,TD} be a collection of D documents having n characters in total. Given two patterns P...
In computational linguistics, large tree databases tagged with morpho-syntactic information are in n...
Let T be a string with n characters over an alphabet of constant size. A recent breakthrough on comp...
International audienceWe study a document retrieval problem in the new framework where D text docume...
Let D = {d1, d2, d3,..., dD} be a given set of D (string) docu-ments of total length n. The top-k do...
LetDbe a collection ofDdocuments, which are strings over an alphabet of sizeσ,of total lengthn. We ...
[[abstract]]In the document retrieval problem [9], we are given a collection of documents (strings) ...
We address the problem of indexing a collectionD = {T1,T2,...TD} of D string documents of total leng...
Document retrieval is one of the best established information retrieval activities since the sixties...
Artículo de publicación ISIDocument retrieval is one of the best-established information retrieval a...
[[abstract]]Given a set D = fd1; d2; :::; dDg of D strings of total length n, our task is to report ...
This paper describes a new categorization learning algorithm, called Categorization In A Hierarchica...
The aim of this work is to show the novel index structure based suffix array and ternary search tree...
Document archives contain large amounts of data to which sophisticated queries are applied. The size...
Let D = {d_1, d_2, ...} be a collection of string documents of n characters in total, which are draw...
Let D={T1,T2,…,TD} be a collection of D documents having n characters in total. Given two patterns P...
In computational linguistics, large tree databases tagged with morpho-syntactic information are in n...
Let T be a string with n characters over an alphabet of constant size. A recent breakthrough on comp...
International audienceWe study a document retrieval problem in the new framework where D text docume...
Let D = {d1, d2, d3,..., dD} be a given set of D (string) docu-ments of total length n. The top-k do...
LetDbe a collection ofDdocuments, which are strings over an alphabet of sizeσ,of total lengthn. We ...
[[abstract]]In the document retrieval problem [9], we are given a collection of documents (strings) ...
We address the problem of indexing a collectionD = {T1,T2,...TD} of D string documents of total leng...
Document retrieval is one of the best established information retrieval activities since the sixties...
Artículo de publicación ISIDocument retrieval is one of the best-established information retrieval a...
[[abstract]]Given a set D = fd1; d2; :::; dDg of D strings of total length n, our task is to report ...
This paper describes a new categorization learning algorithm, called Categorization In A Hierarchica...
The aim of this work is to show the novel index structure based suffix array and ternary search tree...
Document archives contain large amounts of data to which sophisticated queries are applied. The size...
Let D = {d_1, d_2, ...} be a collection of string documents of n characters in total, which are draw...
Let D={T1,T2,…,TD} be a collection of D documents having n characters in total. Given two patterns P...
In computational linguistics, large tree databases tagged with morpho-syntactic information are in n...
Let T be a string with n characters over an alphabet of constant size. A recent breakthrough on comp...