This thesis addresses important algorithms and data structures used in sequence analysis for applications such as read mapping. First, we give an overview on state-of-the-art FM indices and present the latest improvements. In particular, we will introduce a recently published FM index based on a new data structure: EPR dictionaries. This rank data structures allows search steps in constant time for unidirectional and bidirectional FM indices. To our knowledge this is the first and only constant-time implementation of a bidirectional FM index at the time of writing. We show that its running time is not only optimal in theory, but currently also outperforms all available FM index implementations in practice. Second, we cover approximate st...
Thesis (Ph. D.)--University of Washington, 2001The recent explosion in the availability of long cont...
International audienceMapping reads against a genome sequence is an interesting and useful problem i...
This dissertation proposes a novel tree structure, Error Tree (ET), to more efficiently solve the Ap...
Recent advances in sequencing technology allow to produce billions of base pairs per day in the form...
The work presented in this dissertation deals with establishing efficient methods for solving some a...
The combination of incessant advances in sequencing technology producing large amounts of data and i...
Searching for repetitive structures in DNA sequences is a major problem in bioinformatics research. ...
Recent advances in sequencing technology allow to produce billions of base pairs per day in the form...
High-throughput sequencing has helped to transform our study of biological organisms and processes. ...
In this article, we propose a novel pattern matching algorithm, called BAPM, that performs searching...
Les volumes des données générées par les technologies de séquençage haut débit augmentent exponentie...
The objective of the research in this dissertation is to derive optimal search schemes for approxima...
In the biological sciences, sequence analysis refers to analytical investigations that use nucleic a...
Amounts of data generated by Next Generation Sequencing technologies increase exponentially in recen...
International audienceLarge corpura of texts or of sequences serve as references and are interrogate...
Thesis (Ph. D.)--University of Washington, 2001The recent explosion in the availability of long cont...
International audienceMapping reads against a genome sequence is an interesting and useful problem i...
This dissertation proposes a novel tree structure, Error Tree (ET), to more efficiently solve the Ap...
Recent advances in sequencing technology allow to produce billions of base pairs per day in the form...
The work presented in this dissertation deals with establishing efficient methods for solving some a...
The combination of incessant advances in sequencing technology producing large amounts of data and i...
Searching for repetitive structures in DNA sequences is a major problem in bioinformatics research. ...
Recent advances in sequencing technology allow to produce billions of base pairs per day in the form...
High-throughput sequencing has helped to transform our study of biological organisms and processes. ...
In this article, we propose a novel pattern matching algorithm, called BAPM, that performs searching...
Les volumes des données générées par les technologies de séquençage haut débit augmentent exponentie...
The objective of the research in this dissertation is to derive optimal search schemes for approxima...
In the biological sciences, sequence analysis refers to analytical investigations that use nucleic a...
Amounts of data generated by Next Generation Sequencing technologies increase exponentially in recen...
International audienceLarge corpura of texts or of sequences serve as references and are interrogate...
Thesis (Ph. D.)--University of Washington, 2001The recent explosion in the availability of long cont...
International audienceMapping reads against a genome sequence is an interesting and useful problem i...
This dissertation proposes a novel tree structure, Error Tree (ET), to more efficiently solve the Ap...