ABSTRACT Motivation Metagenome assembly from short next-generation sequencing data is a challenging process due to its large scale and computational complexity. Clustering short reads before assembly offers a unique opportunity for parallel downstream assembly of genomes with individualized optimization. However, current read clustering methods suffer either false negative (under-clustering) or false positive (over-clustering) problems. Results Based on a previously developed scalable read clustering method on Apache Spark, SpaRC, that has very low false positives, here we extended its capability by adding a new method to further cluster small clusters. This method exploits statistics derived from multiple samples in a dataset to reduce the...
AbstractA key challenge in analyzing metagenomics data pertains to assembly of sequenced DNA fragmen...
A metagenomic sample is a set of DNA fragments, randomly extracted from multiple cells in an environ...
MOTIVATION: The microbes that live in an environment can be identified from the combined genomic mat...
Metagenome assembly from short next-generation sequencing data is a challenging process due to its l...
Metagenome assembly from short next-generation sequencing data is a challenging process due to its l...
ABSTRACT Next-generation sequencing has enabled metagenomics, the study of the genomes of microorgan...
ABSTRACT Next-generation sequencing has enabled metagenomics, the study of the genomes of microorgan...
Abstract Background The metagenomics approach allows the simultaneous sequencing of all genomes in a...
Motivation: The microbes that live in an environment can be identified from the combined genomic mat...
Motivation: The microbes that live in an environment can be identified from the combined genomic mat...
MOTIVATION: The microbes that live in an environment can be identified from the combined genomic mat...
Motivation: The microbes that live in an environment can be identified from the combined genomic mat...
Motivation: The microbes that live in an environment can be identified from the combined genomic mat...
Motivation: The microbes that live in an environment can be identified from the combined genomic mat...
Motivation: The microbes that live in an environment can be identified from the combined genomic mat...
AbstractA key challenge in analyzing metagenomics data pertains to assembly of sequenced DNA fragmen...
A metagenomic sample is a set of DNA fragments, randomly extracted from multiple cells in an environ...
MOTIVATION: The microbes that live in an environment can be identified from the combined genomic mat...
Metagenome assembly from short next-generation sequencing data is a challenging process due to its l...
Metagenome assembly from short next-generation sequencing data is a challenging process due to its l...
ABSTRACT Next-generation sequencing has enabled metagenomics, the study of the genomes of microorgan...
ABSTRACT Next-generation sequencing has enabled metagenomics, the study of the genomes of microorgan...
Abstract Background The metagenomics approach allows the simultaneous sequencing of all genomes in a...
Motivation: The microbes that live in an environment can be identified from the combined genomic mat...
Motivation: The microbes that live in an environment can be identified from the combined genomic mat...
MOTIVATION: The microbes that live in an environment can be identified from the combined genomic mat...
Motivation: The microbes that live in an environment can be identified from the combined genomic mat...
Motivation: The microbes that live in an environment can be identified from the combined genomic mat...
Motivation: The microbes that live in an environment can be identified from the combined genomic mat...
Motivation: The microbes that live in an environment can be identified from the combined genomic mat...
AbstractA key challenge in analyzing metagenomics data pertains to assembly of sequenced DNA fragmen...
A metagenomic sample is a set of DNA fragments, randomly extracted from multiple cells in an environ...
MOTIVATION: The microbes that live in an environment can be identified from the combined genomic mat...