Our work aims to explore interoperability of large-scale cloud data processing software in HPC environments, and involves clustering and 3D visualization of gene sequence collections. We completed visualizing samples of 100K – 400K fungal sequences. This research involves major supercomputer computation as both clustering and visualization steps scale, even up to the square of the sample size. Our experiments were conducted on Indiana University’s Big Red II [1] supercomputer and in collaboration with bioinformatics and biology faculty at IU. communication by transforming map-reduc
High-throughput experiments enable researchers to explore complex multifactorial diseases through la...
High-throughput experiments enable researchers to explore complex multifactorial diseases through la...
The Cancer GenomeCollaboratory is a compute cloud environment that was set up to facilitate analysis...
Ever since high-throughput DNA sequencing became economically feasible, the amount of biological dat...
Kary Ocaña,1 Daniel de Oliveira2 1National Laboratory of Scientific Computing, Petrópo...
The use of machine learning techniques, in particular unsupervised clustering and dimensionality red...
Hi-C experiments generate data in form of large genome contact maps (Hi-C maps). These show that chr...
This thesis studies exploratory cluster analysis of genomic high-throughput data sets and their inte...
The developing human brain is a complex process, governed by the human genome. Understanding this de...
Pathogen genomic data analysis can be extremely bespoke and diverse. This paper presents our plan an...
Pathogen genomic data analysis can be extremely bespoke and diverse. This paper presents our plan an...
Bioinformatics is a much updated topic for the recent researchers. There are various tasks of bioinf...
Cloud computing is often adopted to process big\ud data for genome analysis due to its elasticity an...
Modern biology is largely shaped by the development of next generation sequencing (NGS) technology. ...
The Final Year Project, Visualization and Sharing of Genomics Data via a Cloud Based System, documen...
High-throughput experiments enable researchers to explore complex multifactorial diseases through la...
High-throughput experiments enable researchers to explore complex multifactorial diseases through la...
The Cancer GenomeCollaboratory is a compute cloud environment that was set up to facilitate analysis...
Ever since high-throughput DNA sequencing became economically feasible, the amount of biological dat...
Kary Ocaña,1 Daniel de Oliveira2 1National Laboratory of Scientific Computing, Petrópo...
The use of machine learning techniques, in particular unsupervised clustering and dimensionality red...
Hi-C experiments generate data in form of large genome contact maps (Hi-C maps). These show that chr...
This thesis studies exploratory cluster analysis of genomic high-throughput data sets and their inte...
The developing human brain is a complex process, governed by the human genome. Understanding this de...
Pathogen genomic data analysis can be extremely bespoke and diverse. This paper presents our plan an...
Pathogen genomic data analysis can be extremely bespoke and diverse. This paper presents our plan an...
Bioinformatics is a much updated topic for the recent researchers. There are various tasks of bioinf...
Cloud computing is often adopted to process big\ud data for genome analysis due to its elasticity an...
Modern biology is largely shaped by the development of next generation sequencing (NGS) technology. ...
The Final Year Project, Visualization and Sharing of Genomics Data via a Cloud Based System, documen...
High-throughput experiments enable researchers to explore complex multifactorial diseases through la...
High-throughput experiments enable researchers to explore complex multifactorial diseases through la...
The Cancer GenomeCollaboratory is a compute cloud environment that was set up to facilitate analysis...