In this paper, we explore the benefits of automatically determining the degree of parallelism used to perform genetic mutation calling in a hybrid cloud environment. We propose algorithms to automatically control both the hiring of hybrid cloud resources and the selection of the degree of parallelism employed in analysis tasks executed against that cloud. Using the Broad Institute's Genome Analysis Toolkit as a case study, we then conduct profile-driven simulation studies to characterise the circumstances in which our algorithms are beneficial or deleterious compared to simple, conventional baseline algorithms. We find that there are a wide range of cloud workload scenarios where our algorithms outperform the baselines, and thereby argue th...
The combination of the Hadoop MapReduce programming model and cloud computing allows biological scie...
Computational biology applications typically favor a local, cluster-based, integrated computational ...
Abstract. The advent of next generation sequencing technology has generated massive amounts of biolo...
In this paper, we explore the benefits of automatically determining the degree of parallelism used t...
Cloud computing is often adopted to process big\ud data for genome analysis due to its elasticity an...
Next-generation sequencing (NGS) technologies have made it possible to rapidly sequence the human ge...
Background Comparative genomics resources, such as ortholog detection tools and repositories are rap...
A major bottleneck in biological discovery is now emerging at the computational level. Cloud computi...
<div><p>A major bottleneck in biological discovery is now emerging at the computational level. Cloud...
Kary Ocaña,1 Daniel de Oliveira2 1National Laboratory of Scientific Computing, Petrópo...
Cloud computing is often adopted to process big data for genome analysis due to its elasticity and p...
A major bottleneck in biological discovery is now emerging at the computational level. Cloud computi...
With the rapidly growing demand for DNA analysis, the need for storing and processing large-scale ge...
Population scale sequencing of whole human genomes is becoming economically feasible; however, data ...
<div><p>Population scale sequencing of whole human genomes is becoming economically feasible; howeve...
The combination of the Hadoop MapReduce programming model and cloud computing allows biological scie...
Computational biology applications typically favor a local, cluster-based, integrated computational ...
Abstract. The advent of next generation sequencing technology has generated massive amounts of biolo...
In this paper, we explore the benefits of automatically determining the degree of parallelism used t...
Cloud computing is often adopted to process big\ud data for genome analysis due to its elasticity an...
Next-generation sequencing (NGS) technologies have made it possible to rapidly sequence the human ge...
Background Comparative genomics resources, such as ortholog detection tools and repositories are rap...
A major bottleneck in biological discovery is now emerging at the computational level. Cloud computi...
<div><p>A major bottleneck in biological discovery is now emerging at the computational level. Cloud...
Kary Ocaña,1 Daniel de Oliveira2 1National Laboratory of Scientific Computing, Petrópo...
Cloud computing is often adopted to process big data for genome analysis due to its elasticity and p...
A major bottleneck in biological discovery is now emerging at the computational level. Cloud computi...
With the rapidly growing demand for DNA analysis, the need for storing and processing large-scale ge...
Population scale sequencing of whole human genomes is becoming economically feasible; however, data ...
<div><p>Population scale sequencing of whole human genomes is becoming economically feasible; howeve...
The combination of the Hadoop MapReduce programming model and cloud computing allows biological scie...
Computational biology applications typically favor a local, cluster-based, integrated computational ...
Abstract. The advent of next generation sequencing technology has generated massive amounts of biolo...