Let D = {d_1, d_2, ...} be a collection of string documents of n characters in total, which are drawn from an alphabet set Sigma =[sigma] ={1,2,3,...sigma}. The top-k document retrieval problem is to maintain D as a data structure, such that when ever a query Q=(P, k) comes, we can report (the identifiers of) those k documents that are most relevant to the pattern P (of p characters). The relevance of a document d_r with respect to a pattern P is captured by score(P, d_r), which can be any function of the set of locations where P occurs in d_r. Finding the most relevant documents to the user query is the central task of any web-search engine. In the case of web-data, the documents can be demarcated along word boundaries. All the search engi...
This doctoral dissertation presents a range of results concerning efficient algorithms and data stru...
The suffix array (SA), which is an array containing the suffixes of a string sorted into lexicograph...
AbstractWe give new space/time tradeoffs for compressed indexes that answer document retrieval queri...
Given text which is a union of d documents of strings, D = d1, d2,...., dd, the emphasis of this the...
[[abstract]]Given a set D = fd1; d2; :::; dDg of D strings of total length n, our task is to report ...
This thesis is devoted to designing and analyzing efficient text indexing data structures and associ...
This thesis focuses on the design of succinct and compressed data structures for collections of stri...
Let D = {d1, d2, d3,..., dD} be a given set of D (string) docu-ments of total length n. The top-k do...
We address the problem of indexing a collectionD = {T1,T2,...TD} of D string documents of total leng...
LetDbe a collection ofDdocuments, which are strings over an alphabet of sizeσ,of total lengthn. We ...
Abstract This paper is about compressed full-text indexes. That is, our goal is to represent full-te...
Let T be a fixed text-string of length n and P be a varying pattern-string of length |P| \u3c= n. B...
Strings play an important role in many areas of computer science. Searching pattern in a string or s...
This is the published version. Copyright 2005 Society for Industrial and Applied MathematicsThe pro...
The proliferation of online text, such as found on the World Wide Web and in online databases, motiv...
This doctoral dissertation presents a range of results concerning efficient algorithms and data stru...
The suffix array (SA), which is an array containing the suffixes of a string sorted into lexicograph...
AbstractWe give new space/time tradeoffs for compressed indexes that answer document retrieval queri...
Given text which is a union of d documents of strings, D = d1, d2,...., dd, the emphasis of this the...
[[abstract]]Given a set D = fd1; d2; :::; dDg of D strings of total length n, our task is to report ...
This thesis is devoted to designing and analyzing efficient text indexing data structures and associ...
This thesis focuses on the design of succinct and compressed data structures for collections of stri...
Let D = {d1, d2, d3,..., dD} be a given set of D (string) docu-ments of total length n. The top-k do...
We address the problem of indexing a collectionD = {T1,T2,...TD} of D string documents of total leng...
LetDbe a collection ofDdocuments, which are strings over an alphabet of sizeσ,of total lengthn. We ...
Abstract This paper is about compressed full-text indexes. That is, our goal is to represent full-te...
Let T be a fixed text-string of length n and P be a varying pattern-string of length |P| \u3c= n. B...
Strings play an important role in many areas of computer science. Searching pattern in a string or s...
This is the published version. Copyright 2005 Society for Industrial and Applied MathematicsThe pro...
The proliferation of online text, such as found on the World Wide Web and in online databases, motiv...
This doctoral dissertation presents a range of results concerning efficient algorithms and data stru...
The suffix array (SA), which is an array containing the suffixes of a string sorted into lexicograph...
AbstractWe give new space/time tradeoffs for compressed indexes that answer document retrieval queri...