We present an algorithm to generate a set of highly reliable overlaps based on identifying repeat k-mers. This method does not require uniform coverage. We also describe an error correction method based on multi-sequence comparison, that allows extending usable sequence in reads trimmed to 16 % error rate. We use a version of the Phrap assembly program that uses only overlaps computed by the UMD overlapper, called PhrapUMD. We integrate the University of Maryland algorithms with Baylor’s ATLAS assembler applied to Rattus norvegicus. Starting with the same data as the Nov. 2002 ATLAS assembly, we compare our results to 4.5 Mbp of rat sequence in 21 BACs that have been finished. We find that after extension and error correction, (i) the reads...
Next-generation sequencers such as Illumina can now produce reads up to 300 bp with high throughput,...
The recent release of twenty-two new genome sequences has dramatically increased the data available ...
Next-generation sequencers such as Illumina can now produce reads up to 300 bp with high throughput,...
We present a reliable, easy to implement algorithm to generate a set of highly reliable overlaps bas...
We present an algorithm to generate a set of highly reliable overlaps based on identifying repeat k-...
We present a reliable, easy to implement algorithm to generate a set of highly reliable overlaps bas...
The assembly methods used for whole-genome shotgun (WGS) data have a major impact on the quality of ...
The assembly methods used for whole-genome shotgun (WGS) data have a major impact on the quality of ...
<p>The “original Atlas with UMD Plausible” and “original Atlas with UMD reliable” assembly results o...
<p>The correct positions of reads A, B, C and D are shown. (b) A “fork” in the overlaps. (c) a scena...
The whole-genome shotgun (WGS) assembly technique has been remarkably successful in efforts to deter...
We describe a Sequence assembler, Reps (repeat-masked Phrap with scaffolding), that explicitly ident...
Shotgun sequencing is the most powerful strategy for large scale sequencing. Two main approaches exi...
The next generation sequencing technology creates a huge number of sequences (reads), which constitu...
The next-generation sequencing (NGS) technology outputs a huge number of sequences (reads) that requ...
Next-generation sequencers such as Illumina can now produce reads up to 300 bp with high throughput,...
The recent release of twenty-two new genome sequences has dramatically increased the data available ...
Next-generation sequencers such as Illumina can now produce reads up to 300 bp with high throughput,...
We present a reliable, easy to implement algorithm to generate a set of highly reliable overlaps bas...
We present an algorithm to generate a set of highly reliable overlaps based on identifying repeat k-...
We present a reliable, easy to implement algorithm to generate a set of highly reliable overlaps bas...
The assembly methods used for whole-genome shotgun (WGS) data have a major impact on the quality of ...
The assembly methods used for whole-genome shotgun (WGS) data have a major impact on the quality of ...
<p>The “original Atlas with UMD Plausible” and “original Atlas with UMD reliable” assembly results o...
<p>The correct positions of reads A, B, C and D are shown. (b) A “fork” in the overlaps. (c) a scena...
The whole-genome shotgun (WGS) assembly technique has been remarkably successful in efforts to deter...
We describe a Sequence assembler, Reps (repeat-masked Phrap with scaffolding), that explicitly ident...
Shotgun sequencing is the most powerful strategy for large scale sequencing. Two main approaches exi...
The next generation sequencing technology creates a huge number of sequences (reads), which constitu...
The next-generation sequencing (NGS) technology outputs a huge number of sequences (reads) that requ...
Next-generation sequencers such as Illumina can now produce reads up to 300 bp with high throughput,...
The recent release of twenty-two new genome sequences has dramatically increased the data available ...
Next-generation sequencers such as Illumina can now produce reads up to 300 bp with high throughput,...