We consider the problem of encoding a string of length n from an integer alphabet of size so access, substring equality, and Longest Common Extension (LCE) queries can be answered efficiently. We describe a new space-optimal data structure supporting logarithmic-time queries. Access and substring equality query times can furthermore be improved to the optimal O(1) if O(log n) additional precomputed words are allowed in the total space. Additionally, we provide in-place algorithms for converting between the string and our data structure. Using this new string representation, we obtain the first in-place subquadratic algorithms for several string-processing problems in the restore model: The input string is rewritable and must be restored bef...
This thesis is devoted to designing and analyzing efficient text indexing data structures and associ...
AbstractWe study the position restricted substring searching (PRSS) problem, where the task is to in...
WOS: 000347602000043We revisit the problem of finding shortest unique substring (SUS) proposed recen...
We consider the problem of encoding a string of length n from an integer alphabet of size so access,...
Suffix arrays encode the lexicographical order of all suffixes of a text and are often combined with...
In this work we present efficient algorithms for constructing sparse suffix trees, sparse suffix arr...
Abstract. We consider the problem of constructing a sparse suffix tree (or suffix array) for b suffi...
Abstract This paper is about compressed full-text indexes. That is, our goal is to represent full-te...
International audienceIn the classic longest common substring (LCS) problem, we are given two string...
We design two compressed data structures for the full-text indexing problem that support efficient s...
In the classic longest common substring (LCS) problem, we are given two strings S and T, each of len...
Abstract. We prove that longest common prefix (LCP) information can be stored in much less space tha...
The proliferation of online text, such as found on the World Wide Web and in online databases, motiv...
Given a string S of n symbols, a longest common extension query LCE(i,j) asks for the length of the ...
We propose a new algorithmic framework that solves frequency-related data mining queries on database...
This thesis is devoted to designing and analyzing efficient text indexing data structures and associ...
AbstractWe study the position restricted substring searching (PRSS) problem, where the task is to in...
WOS: 000347602000043We revisit the problem of finding shortest unique substring (SUS) proposed recen...
We consider the problem of encoding a string of length n from an integer alphabet of size so access,...
Suffix arrays encode the lexicographical order of all suffixes of a text and are often combined with...
In this work we present efficient algorithms for constructing sparse suffix trees, sparse suffix arr...
Abstract. We consider the problem of constructing a sparse suffix tree (or suffix array) for b suffi...
Abstract This paper is about compressed full-text indexes. That is, our goal is to represent full-te...
International audienceIn the classic longest common substring (LCS) problem, we are given two string...
We design two compressed data structures for the full-text indexing problem that support efficient s...
In the classic longest common substring (LCS) problem, we are given two strings S and T, each of len...
Abstract. We prove that longest common prefix (LCP) information can be stored in much less space tha...
The proliferation of online text, such as found on the World Wide Web and in online databases, motiv...
Given a string S of n symbols, a longest common extension query LCE(i,j) asks for the length of the ...
We propose a new algorithmic framework that solves frequency-related data mining queries on database...
This thesis is devoted to designing and analyzing efficient text indexing data structures and associ...
AbstractWe study the position restricted substring searching (PRSS) problem, where the task is to in...
WOS: 000347602000043We revisit the problem of finding shortest unique substring (SUS) proposed recen...