In a sequencing experiment, we can only “read” small fragments (reads) of DNA due to physical limitations, whose location on the genome is unknown. De novo assembly aims to put them together to retrieve the full DNA sequence, like a jigsaw puzzle. The OLC approach computes pairwise Overlaps between reads to find their Layout, and then derive a Consensus sequence. The layout can be cast as an instance of the Seriation combinatorial problem, seeking to reorder a set of elements based on their pairwise similarity, such that similar elements are nearby. In a noiseless setting, a spectral method can solve Seriation efficiently. Still, it often fails on noisy, real DNA data. Notably, assembly is challenged by repeated genomic regions (repeats) ca...
DNA barcodes carry coarse-grained genetic information of DNA sequences taken from a genome. Potentia...
Abstract Background Many organisms, in particular bacteria, contain repetitive DNA fragments called ...
The output from whole genome sequencing is a set of contigs, i.e. short non-overlapping DNA sequence...
Les technologies de séquençage d’ADN ne permettent de lire que de courts fragments, dont on ignore l...
The seriation problem seeks to reorder a set of elements given pairwise similarity information, so t...
In this thesis we discuss computational methods to deal with DNA sequences provided byhigh throughpu...
Novel approaches for the exploitation of high throughput sequencing data In this thesis we discuss c...
In this thesis, we discuss computational methods (theoretical models and algorithms) to perform the ...
High throughput DNA sequencing technologies are fuelling an accelerating trend to assemble genomes d...
Sequencing is a process in biology that determines the order of nucleotides in the DNA. It produces ...
In the last decade, sequencing technology has progressed rapidly, leading to much faster and cheaper...
Hristo Djidjev : collaborateur d'équipe associée HipcoGenNational audienceDNA molecular in silico an...
Since a few years, molecular genomics has had to deal with newsituations. First, the amount of data ...
The development of the next-generation sequencing methods has allowed the generation of vast amounts...
Obtaining DNA sequences is a time-consuming task, which typically requires one or several days for c...
DNA barcodes carry coarse-grained genetic information of DNA sequences taken from a genome. Potentia...
Abstract Background Many organisms, in particular bacteria, contain repetitive DNA fragments called ...
The output from whole genome sequencing is a set of contigs, i.e. short non-overlapping DNA sequence...
Les technologies de séquençage d’ADN ne permettent de lire que de courts fragments, dont on ignore l...
The seriation problem seeks to reorder a set of elements given pairwise similarity information, so t...
In this thesis we discuss computational methods to deal with DNA sequences provided byhigh throughpu...
Novel approaches for the exploitation of high throughput sequencing data In this thesis we discuss c...
In this thesis, we discuss computational methods (theoretical models and algorithms) to perform the ...
High throughput DNA sequencing technologies are fuelling an accelerating trend to assemble genomes d...
Sequencing is a process in biology that determines the order of nucleotides in the DNA. It produces ...
In the last decade, sequencing technology has progressed rapidly, leading to much faster and cheaper...
Hristo Djidjev : collaborateur d'équipe associée HipcoGenNational audienceDNA molecular in silico an...
Since a few years, molecular genomics has had to deal with newsituations. First, the amount of data ...
The development of the next-generation sequencing methods has allowed the generation of vast amounts...
Obtaining DNA sequences is a time-consuming task, which typically requires one or several days for c...
DNA barcodes carry coarse-grained genetic information of DNA sequences taken from a genome. Potentia...
Abstract Background Many organisms, in particular bacteria, contain repetitive DNA fragments called ...
The output from whole genome sequencing is a set of contigs, i.e. short non-overlapping DNA sequence...