The presence of duplicates introduced by PCR amplification is a major issue in paired short reads from next-generation sequencing platforms. These duplicates might have a serious impact on research applications, such as scaffolding in whole-genome sequencing and discovering large-scale genome variations, and are usually removed. We present FastUniq as a fast de novo tool for removal of duplicates in paired short reads. FastUniq identifies duplicates by comparing sequences between read pairs and does not require complete genome sequences as prerequisites. FastUniq is capable of simultaneously handling reads with different lengths and results in highly efficient running time, which increases linearly at an average speed of 87 million reads pe...
Removing duplicate and near-duplicate reads, generated by high-throughput sequencing technologies, i...
International audienceLinked reads technologies, such as the 10X chromium system, use microfluidics ...
AbstractNext-generation sequencing platforms generate short (50–150bp) reads that can be mapped onto...
The presence of duplicates introduced by PCR amplification is a major issue in paired short reads fr...
<div><p>The presence of duplicates introduced by PCR amplification is a major issue in paired short ...
Background: The first step of virtually all next generation sequencing analysis involves the splitti...
The increasingly widespread use of next generation sequencing protocols has brought the need for the...
This is a pre-copyedited, author-produced version of an article accepted for publication in Bioinfor...
Abstract Background Next generation sequencing datasets are stored as FASTQ formatted files. In orde...
BackgroundRNA sequencing (RNA-seq) has become the standard means of analyzing gene and transcript ex...
<p>(A) The number of read pairs before and after duplicates removal using FastUniq or the mapping-ba...
Background: During library construction polymerase chain reaction is used to enrich the DNA before s...
Several methods for ultra-high throughput DNA sequencing are currently under investigation. Many of ...
DNA sequencing analysis typically involves mapping reads to just one reference genome. Mapping again...
Background: RNA-seq and small RNA-seq are powerful, quantitative tools to study gene regulation and ...
Removing duplicate and near-duplicate reads, generated by high-throughput sequencing technologies, i...
International audienceLinked reads technologies, such as the 10X chromium system, use microfluidics ...
AbstractNext-generation sequencing platforms generate short (50–150bp) reads that can be mapped onto...
The presence of duplicates introduced by PCR amplification is a major issue in paired short reads fr...
<div><p>The presence of duplicates introduced by PCR amplification is a major issue in paired short ...
Background: The first step of virtually all next generation sequencing analysis involves the splitti...
The increasingly widespread use of next generation sequencing protocols has brought the need for the...
This is a pre-copyedited, author-produced version of an article accepted for publication in Bioinfor...
Abstract Background Next generation sequencing datasets are stored as FASTQ formatted files. In orde...
BackgroundRNA sequencing (RNA-seq) has become the standard means of analyzing gene and transcript ex...
<p>(A) The number of read pairs before and after duplicates removal using FastUniq or the mapping-ba...
Background: During library construction polymerase chain reaction is used to enrich the DNA before s...
Several methods for ultra-high throughput DNA sequencing are currently under investigation. Many of ...
DNA sequencing analysis typically involves mapping reads to just one reference genome. Mapping again...
Background: RNA-seq and small RNA-seq are powerful, quantitative tools to study gene regulation and ...
Removing duplicate and near-duplicate reads, generated by high-throughput sequencing technologies, i...
International audienceLinked reads technologies, such as the 10X chromium system, use microfluidics ...
AbstractNext-generation sequencing platforms generate short (50–150bp) reads that can be mapped onto...