Abstract. Many de novo assembly tools have been created these last few years to assemble short reads generated by high throughput sequencing platforms. The core of almost all these assemblers is a sequence graph data structure that links reads together. This motivates our work: BlastGraph, a new algorithm performing intensive approx-imate string matching between a set of query sequences and a sequence graph. Our approach is similar to blast-like algorithms and additionally presents specificity due to the matching on the graph data structure. Our results show that BlastGraph perfor-mances permit its usage on large graphs in reasonable time. We propose a Cytoscape plug-in for visualizing results as well as a command line program. These progra...
The Pattern Matching in Labeled Graphs (PMLG) problem consists in finding a match for a string that ...
The past decade has witnessed the emergence of massive graph data. Graph is an important data struct...
Both the amount of data available and the rate at which it is acquired increases rapidly. The underl...
International audienceAbstract. Many de novo assembly tools have been created these last few years t...
Motivation: The de Bruijn graph is a simple and efficient data structure that is used in many areas ...
This thesis presents an application of a generalized suffix tree extended by the use of frequency of...
Motivation: New sequencing technologies generate larger amount of short reads data at decreasing cos...
Graphs are a powerful data structure that can be applied to several problems in bioinformatics. Grap...
Inexact graph matching algorithms have proved to be useful in many applications, such as character r...
International audienceDNA sequencing technologies have tremendously increased their throughput, and ...
International audienceSuffix trees belong to the most studied indexing data structures for strings. ...
We study the design of efficient algorithms for combinatorial pattern matching. More concretely, we ...
Abstract. Motivated by the imminent growth of massive, highly redun-dant genomic databases we study ...
AbstractDNA sequencing technologies have tremendously increased their throughput, and hence complica...
Part 1: Algorithms, Scheduling, Analysis, and Data MiningInternational audienceMassively parallel DN...
The Pattern Matching in Labeled Graphs (PMLG) problem consists in finding a match for a string that ...
The past decade has witnessed the emergence of massive graph data. Graph is an important data struct...
Both the amount of data available and the rate at which it is acquired increases rapidly. The underl...
International audienceAbstract. Many de novo assembly tools have been created these last few years t...
Motivation: The de Bruijn graph is a simple and efficient data structure that is used in many areas ...
This thesis presents an application of a generalized suffix tree extended by the use of frequency of...
Motivation: New sequencing technologies generate larger amount of short reads data at decreasing cos...
Graphs are a powerful data structure that can be applied to several problems in bioinformatics. Grap...
Inexact graph matching algorithms have proved to be useful in many applications, such as character r...
International audienceDNA sequencing technologies have tremendously increased their throughput, and ...
International audienceSuffix trees belong to the most studied indexing data structures for strings. ...
We study the design of efficient algorithms for combinatorial pattern matching. More concretely, we ...
Abstract. Motivated by the imminent growth of massive, highly redun-dant genomic databases we study ...
AbstractDNA sequencing technologies have tremendously increased their throughput, and hence complica...
Part 1: Algorithms, Scheduling, Analysis, and Data MiningInternational audienceMassively parallel DN...
The Pattern Matching in Labeled Graphs (PMLG) problem consists in finding a match for a string that ...
The past decade has witnessed the emergence of massive graph data. Graph is an important data struct...
Both the amount of data available and the rate at which it is acquired increases rapidly. The underl...