Metagenomics is the investigation of genetic samples directly obtained from the environment. Driven by the rapid development of DNA sequencing technology and continuous reductions in sequencing costs, studies in metagenomics become popular over the past few years with the potential to discover novel knowledge in many fields through analysing the diversity of microbial ecology. The availability of large-scale datasets increases the challenge in data analysis, especially for hierarchical clustering that has a quadratic time complexity. This thesis presents the design and implementation of a parallelisation method for single-linkage hierarchical clustering for metagenomics data. Using 16 parallel threads, p-swarm was measured to achieve 11 tim...
Backgrounds: Recent explosion of biological data brings a great challenge for the traditional cluste...
Part 4: Big Data+CloudInternational audienceGreat efforts have been made on meta-genomics in the fie...
BACKGROUNDS: Recent explosion of biological data brings a great challenge for the traditional cluste...
Abstract—Taxonomic clustering of species is an impor-tant and frequently arising problem in metageno...
Cluster analysis or clustering is an important data mining technique widely used for pattern recogni...
The rapid advances of high-throughput sequencing technologies dramatically prompted metagenomic stud...
The rapid development of sequencing technology has led to an explosive accumulation of genomic seque...
<div><p>The rapid development of sequencing technology has led to an explosive accumulation of genom...
Abstract Background In recent years, the demand for computational power in computational biology has...
In this dissertation, we address three different problems in high-throughput metagenomics and chemin...
Background: Metagenomics method directly sequences and analyses genome information from microbial co...
Thesis (Ph.D.), Department of Electrical Engineering and Computer Science, Washington State Universi...
Currently, clustering applications use classical methods to partition a set of data (or objects) in ...
Genomic datasets are growing dramatically as the cost of sequencing continues to decline and small s...
Genomic sequences can be viewed as special types of documents. These are typically organised and sto...
Backgrounds: Recent explosion of biological data brings a great challenge for the traditional cluste...
Part 4: Big Data+CloudInternational audienceGreat efforts have been made on meta-genomics in the fie...
BACKGROUNDS: Recent explosion of biological data brings a great challenge for the traditional cluste...
Abstract—Taxonomic clustering of species is an impor-tant and frequently arising problem in metageno...
Cluster analysis or clustering is an important data mining technique widely used for pattern recogni...
The rapid advances of high-throughput sequencing technologies dramatically prompted metagenomic stud...
The rapid development of sequencing technology has led to an explosive accumulation of genomic seque...
<div><p>The rapid development of sequencing technology has led to an explosive accumulation of genom...
Abstract Background In recent years, the demand for computational power in computational biology has...
In this dissertation, we address three different problems in high-throughput metagenomics and chemin...
Background: Metagenomics method directly sequences and analyses genome information from microbial co...
Thesis (Ph.D.), Department of Electrical Engineering and Computer Science, Washington State Universi...
Currently, clustering applications use classical methods to partition a set of data (or objects) in ...
Genomic datasets are growing dramatically as the cost of sequencing continues to decline and small s...
Genomic sequences can be viewed as special types of documents. These are typically organised and sto...
Backgrounds: Recent explosion of biological data brings a great challenge for the traditional cluste...
Part 4: Big Data+CloudInternational audienceGreat efforts have been made on meta-genomics in the fie...
BACKGROUNDS: Recent explosion of biological data brings a great challenge for the traditional cluste...