Part 4: Big Data+CloudInternational audienceGreat efforts have been made on meta-genomics in the field of new species exploration in the past decades. With the development of next-generation sequencing technology, meta-genomics datasets have been produced as large as dozens of hundreds of gigabytes or even several terabytes, which brings a severe challenge to data analysis. Besides, conventional meta-genomics comparing algorithms may not take full advantage of powerful computing capacity from parallel computing techniques due to lack of parallelism. In this paper, we propose DDP-B, a distributed dynamic parallel framework for meta-genomics binary similarity analysis, to overcome these limitations. In this framework, we introduce a binary di...
International audienceThis paper presents a joint effort between a group of computer scientists and ...
Abstract—Taxonomic clustering of species is an impor-tant and frequently arising problem in metageno...
In this manuscript, we present an optimized and parallel version of our previous work IMSAME...
Background: Metagenomics method directly sequences and analyses genome information from microbial co...
Thesis (Ph.D.), School of Electrical Engineering and Computer Science, Washington State UniversityTh...
Computers today become more powerful through increased numbers of processors rather than clock speed...
Trabajo presentado al 4th International Workshop on Parallelism in Bioinformatics (euro-Par), celebr...
In the field of biology, researchers need to compare genes or gene products using semantic similarit...
Metagenomics is the investigation of genetic samples directly obtained from the environment. Driven ...
Genomic datasets are growing dramatically as the cost of sequencing continues to decline and small s...
Sequence similarity searches have been widely used in the analyses of metagenomic sequencing data. F...
Generalizable approaches, models, and frameworks for irregular application scalability is an old yet...
A revolution in personalized genomics will occur when scientists can sequence genomes of millions of...
As a distributed data-parallelization (DDP) pattern, MapReduce has been adopted by many new big data...
Genome sequence comparisons of exponentially growing data sets form the foundation for the comparati...
International audienceThis paper presents a joint effort between a group of computer scientists and ...
Abstract—Taxonomic clustering of species is an impor-tant and frequently arising problem in metageno...
In this manuscript, we present an optimized and parallel version of our previous work IMSAME...
Background: Metagenomics method directly sequences and analyses genome information from microbial co...
Thesis (Ph.D.), School of Electrical Engineering and Computer Science, Washington State UniversityTh...
Computers today become more powerful through increased numbers of processors rather than clock speed...
Trabajo presentado al 4th International Workshop on Parallelism in Bioinformatics (euro-Par), celebr...
In the field of biology, researchers need to compare genes or gene products using semantic similarit...
Metagenomics is the investigation of genetic samples directly obtained from the environment. Driven ...
Genomic datasets are growing dramatically as the cost of sequencing continues to decline and small s...
Sequence similarity searches have been widely used in the analyses of metagenomic sequencing data. F...
Generalizable approaches, models, and frameworks for irregular application scalability is an old yet...
A revolution in personalized genomics will occur when scientists can sequence genomes of millions of...
As a distributed data-parallelization (DDP) pattern, MapReduce has been adopted by many new big data...
Genome sequence comparisons of exponentially growing data sets form the foundation for the comparati...
International audienceThis paper presents a joint effort between a group of computer scientists and ...
Abstract—Taxonomic clustering of species is an impor-tant and frequently arising problem in metageno...
In this manuscript, we present an optimized and parallel version of our previous work IMSAME...