Abstract. The advent of next generation sequencing technology has generated massive amounts of biological data at unprecen-dented rates. Comparative genomics applications often require compute-intensive tools for subsequent analysis of high throughput data. Although cloud computing infrastructure plays an important role in this respect, the pressure from such computationally expensive tasks can be further alleviated using efficient data partitioning and workflow fusion. Here, we implement a workflow-based model for parallelizing the data-intensive tasks of genome alignment and variant calling with BWA and GATK’s HaplotypeCaller. We explore three different approaches of partitioning data, granularity-based, individual-based, and alignment-ba...
The development of Next Generation Sequencing (NGS) technology resulted the rapid accumulation of a ...
Abstract Background While next-generation sequencing (NGS) costs have fallen in recent years, the co...
Genomic sequence alignment is an important technique to decode genome sequences in bioinformatics. N...
Trabajo presentado al 4th International Workshop on Parallelism in Bioinformatics (euro-Par), celebr...
With the cost of sequencing a human genome dropping below $1,000, population-scale sequencing has be...
The impending advent of population-scaled sequencing cohorts involving tens of millions of individua...
Advances in whole-genome sequencing have greatly reduced the cost and time of obtaining raw genetic ...
Genomic datasets are growing dramatically as the cost of sequencing continues to decline and small s...
In this paper, we explore the benefits of automatically determining the degree of parallelism used t...
A revolution in personalized genomics will occur when scientists can sequence genomes of millions of...
Bioinformatics research is frequently performed using complex workflows with multiple steps, fans, m...
Whole Genome Sequencing (WGS) provides information for each base of the entire 3.2 billion base pair...
The analysis of next-generation sequencing (NGS) data requires complex computational workf...
Background: While next-generation sequencing (NGS) costs have plummeted in recent years, cost and co...
The continuous increase in sequencing throughput imposes a new generation of tools for data processi...
The development of Next Generation Sequencing (NGS) technology resulted the rapid accumulation of a ...
Abstract Background While next-generation sequencing (NGS) costs have fallen in recent years, the co...
Genomic sequence alignment is an important technique to decode genome sequences in bioinformatics. N...
Trabajo presentado al 4th International Workshop on Parallelism in Bioinformatics (euro-Par), celebr...
With the cost of sequencing a human genome dropping below $1,000, population-scale sequencing has be...
The impending advent of population-scaled sequencing cohorts involving tens of millions of individua...
Advances in whole-genome sequencing have greatly reduced the cost and time of obtaining raw genetic ...
Genomic datasets are growing dramatically as the cost of sequencing continues to decline and small s...
In this paper, we explore the benefits of automatically determining the degree of parallelism used t...
A revolution in personalized genomics will occur when scientists can sequence genomes of millions of...
Bioinformatics research is frequently performed using complex workflows with multiple steps, fans, m...
Whole Genome Sequencing (WGS) provides information for each base of the entire 3.2 billion base pair...
The analysis of next-generation sequencing (NGS) data requires complex computational workf...
Background: While next-generation sequencing (NGS) costs have plummeted in recent years, cost and co...
The continuous increase in sequencing throughput imposes a new generation of tools for data processi...
The development of Next Generation Sequencing (NGS) technology resulted the rapid accumulation of a ...
Abstract Background While next-generation sequencing (NGS) costs have fallen in recent years, the co...
Genomic sequence alignment is an important technique to decode genome sequences in bioinformatics. N...