This work examines a data-intensive irregular application from genomics that represents a type of Generalized N-Body problems, one of the "seven giants"of the NRC Big Data motifs. In this problem, computations (genome alignments) are performed on sparse data-dependent pairs of inputs, with variable cost computation and variable datum sizes. Unlike simulation-based N-Body problems, there is no inherent locality in the pairwise interactions, and the interaction sparsity depends on particular parameters of the input, which can also affect the quality of the output. We build-on a pre-existing bulk-synchronous implementation, using collective communication in MPI, and implement a new asynchronous one, using cross-node RPCs in UPC++. We establish...
A revolution in personalized genomics will occur when scientists can sequence genomes of millions of...
International audienceThe formalisms for describing algorithms have traditionally followed technolog...
Sequence alignment is an important tool for describing the relationships between DNA sequences. Many...
This work examines a data-intensive irregular application from genomics that represents a type of Ge...
This work examines a data-intensive irregular application from genomics, a long-read to long-read al...
Generalizable approaches, models, and frameworks for irregular application scalability is an old yet...
In many scientific and engineering applications, one has to solve not one but a sequence of instance...
Genomic datasets are growing dramatically as the cost of sequencing continues to decline and small s...
Abstract—Modern genotyping technologies are able to obtain up to a few million genetic markers (such...
Generalized linear mixed-effects models in the context of genome-wide association studies (GWAS) rep...
The impending advent of population-scaled sequencing cohorts involving tens of millions of individua...
Comparative genomics is essentially a form of data mining in large collections of n-ary relations be...
Trabajo presentado al 4th International Workshop on Parallelism in Bioinformatics (euro-Par), celebr...
The revolution in next-generation DNA sequencing technologies is leading to explosive data growth in...
Thesis (Ph.D.), School of Electrical Engineering and Computer Science, Washington State UniversityAs...
A revolution in personalized genomics will occur when scientists can sequence genomes of millions of...
International audienceThe formalisms for describing algorithms have traditionally followed technolog...
Sequence alignment is an important tool for describing the relationships between DNA sequences. Many...
This work examines a data-intensive irregular application from genomics that represents a type of Ge...
This work examines a data-intensive irregular application from genomics, a long-read to long-read al...
Generalizable approaches, models, and frameworks for irregular application scalability is an old yet...
In many scientific and engineering applications, one has to solve not one but a sequence of instance...
Genomic datasets are growing dramatically as the cost of sequencing continues to decline and small s...
Abstract—Modern genotyping technologies are able to obtain up to a few million genetic markers (such...
Generalized linear mixed-effects models in the context of genome-wide association studies (GWAS) rep...
The impending advent of population-scaled sequencing cohorts involving tens of millions of individua...
Comparative genomics is essentially a form of data mining in large collections of n-ary relations be...
Trabajo presentado al 4th International Workshop on Parallelism in Bioinformatics (euro-Par), celebr...
The revolution in next-generation DNA sequencing technologies is leading to explosive data growth in...
Thesis (Ph.D.), School of Electrical Engineering and Computer Science, Washington State UniversityAs...
A revolution in personalized genomics will occur when scientists can sequence genomes of millions of...
International audienceThe formalisms for describing algorithms have traditionally followed technolog...
Sequence alignment is an important tool for describing the relationships between DNA sequences. Many...