The minimizers sampling mechanism is a popular mechanism for string sampling. However, minimizers sampling mechanisms lack good guarantees on the expected size of their samples for different combinations of their input parameters. Furthermore, indexes constructed over minimizers samples lack good worst-case guarantees for on-line pattern searches. In response, we propose bidirectional string anchors (bd-anchors), a new string sampling mechanism. Given an integer ℓ, our mechanism selects the lexicographically smallest rotation in every length-ℓ fragment. We show that, like minimizers samples, bd-anchors samples are approximately uniform, locally consistent, and computable in linear time. Furthermore, our experiments demonstrate that the bd-a...
Abstract. Exact string matching is a problem that computer program-mers face on a regular basis, and...
We propose a text indexing technique for approximate pattern matching, which is practical and especi...
The collection indexing problem is defined as follows: Given a collection of highly similar strings,...
The minimizers sampling mechanism is a popular mechanism for string sampling introduced independentl...
The minimizers sampling mechanism is a popular mechanism for string sampling introduced independentl...
AbstractWe introduce a novel alphabet sampling technique for speeding up both online and indexed str...
International audienceWe study strategies of approximate pattern matching that exploit bidirectional...
We study strategies of approximate pattern matching that exploit bidirec-tional text indexes, extend...
International audienceWe study strategies of approximate pattern matching that exploit bidirectional...
Finding approximate occurrences of a pattern in a text using a full-text index is a central problem ...
In many real-world database systems, a large fraction of the data is represented by strings: Sequenc...
A compressed full-text self-index for a text T is a data structure requiring reduced space and able ...
Abstract This paper is about compressed full-text indexes. That is, our goal is to represent full-te...
This paper revisits the problem of indexing a text S[1.,n] to support searching substrings in S that...
The proliferation of online text, such as on the World Wide Web and in databases, motivates the need...
Abstract. Exact string matching is a problem that computer program-mers face on a regular basis, and...
We propose a text indexing technique for approximate pattern matching, which is practical and especi...
The collection indexing problem is defined as follows: Given a collection of highly similar strings,...
The minimizers sampling mechanism is a popular mechanism for string sampling introduced independentl...
The minimizers sampling mechanism is a popular mechanism for string sampling introduced independentl...
AbstractWe introduce a novel alphabet sampling technique for speeding up both online and indexed str...
International audienceWe study strategies of approximate pattern matching that exploit bidirectional...
We study strategies of approximate pattern matching that exploit bidirec-tional text indexes, extend...
International audienceWe study strategies of approximate pattern matching that exploit bidirectional...
Finding approximate occurrences of a pattern in a text using a full-text index is a central problem ...
In many real-world database systems, a large fraction of the data is represented by strings: Sequenc...
A compressed full-text self-index for a text T is a data structure requiring reduced space and able ...
Abstract This paper is about compressed full-text indexes. That is, our goal is to represent full-te...
This paper revisits the problem of indexing a text S[1.,n] to support searching substrings in S that...
The proliferation of online text, such as on the World Wide Web and in databases, motivates the need...
Abstract. Exact string matching is a problem that computer program-mers face on a regular basis, and...
We propose a text indexing technique for approximate pattern matching, which is practical and especi...
The collection indexing problem is defined as follows: Given a collection of highly similar strings,...