The recent release of twenty-two new genome sequences has dramatically increased the data available for mammalian comparative genomics, but twenty of these new sequences are currently limited to ∼2x cov-erage. Here we examine the extent of sequencing error in these 2x assemblies, and its potential impact in downstream analyses. By comparing 2x assemblies with high-quality sequences from the ENCODE regions, we show that sequencing error, at 1–4 errors per kilobase, is sufficiently low for many purposes, yet still can have surprising effects. For example, an apparent lineage-specific insertion in a coding region is more likely to reflect sequencing error than a true biological event, and the length distribution of coding indels is strongly di...
Funding: S.C.V. was funded by a Max Planck Research Group award from the Max Planck Society, and a H...
Motivation: Bioinformatics tools, such as assemblers and aligners, are expected to produce more accu...
Many inferences about the biological properties of an organism depend on the completeness and accura...
The recent release of twenty-two new genome sequences has dramatically increased the data available ...
The recent release of twenty-two new genome sequences has dramatically increased the data available ...
The recent release of twenty-two new genome sequences has dramatically increased the data available ...
We describe a statistical and comparative-genomic approach for quantifying error rates of genome seq...
Tremendous evolvement in sequencing technologies and the vast availability of data due to decreasing...
The study of functional genomics, particularly in non-model organisms, has been dramatically improve...
Single-molecule sequencing instruments can generate multikilobase sequences with the potential to gr...
New sequencing technology has dramatically altered the landscape of whole-genome sequencing, allowin...
Background: A feature common to all DNA sequencing technologies is the presence of base-call errors ...
Abstract Background Recently, many standalone applications have been proposed to correct sequencing ...
Single-molecule sequencing instruments can generate multikilobase sequences with the potential to gr...
Motivation: Bioinformatics tools, such as assemblers and aligners, are expected to produce more accu...
Funding: S.C.V. was funded by a Max Planck Research Group award from the Max Planck Society, and a H...
Motivation: Bioinformatics tools, such as assemblers and aligners, are expected to produce more accu...
Many inferences about the biological properties of an organism depend on the completeness and accura...
The recent release of twenty-two new genome sequences has dramatically increased the data available ...
The recent release of twenty-two new genome sequences has dramatically increased the data available ...
The recent release of twenty-two new genome sequences has dramatically increased the data available ...
We describe a statistical and comparative-genomic approach for quantifying error rates of genome seq...
Tremendous evolvement in sequencing technologies and the vast availability of data due to decreasing...
The study of functional genomics, particularly in non-model organisms, has been dramatically improve...
Single-molecule sequencing instruments can generate multikilobase sequences with the potential to gr...
New sequencing technology has dramatically altered the landscape of whole-genome sequencing, allowin...
Background: A feature common to all DNA sequencing technologies is the presence of base-call errors ...
Abstract Background Recently, many standalone applications have been proposed to correct sequencing ...
Single-molecule sequencing instruments can generate multikilobase sequences with the potential to gr...
Motivation: Bioinformatics tools, such as assemblers and aligners, are expected to produce more accu...
Funding: S.C.V. was funded by a Max Planck Research Group award from the Max Planck Society, and a H...
Motivation: Bioinformatics tools, such as assemblers and aligners, are expected to produce more accu...
Many inferences about the biological properties of an organism depend on the completeness and accura...