The next-generation of supercomputers will feature a diverse mix of accelerator devices. The increase of heterogeneity is explained by the nature of these devices - certain accelerators offer acceleration, or a shorter time to completion, for particular programs. Characteristics of these programs are fixed regardless of which accelerator is used for computation; for instance, a graph traversal program always exhibits the properties of graph traversal regardless of what device it is executed. This work presents a methodology to collect these characteristics and use them to inform the selection of optimal accelerator device. On HPC systems a single node may feature a GPU, CPU, and an FPGA or MIC. The focus of this work is to schedule scientif...
Many emerging applications require hardware acceleration due to their growing computational intensit...
Despite the fact that GPU was originally intended to be as a co-processor specializing in graphics r...
In recent years the designs of High Performance Computing (HPC) clusters have become more complex. T...
Measuring performance-critical characteristics of application workloads is important both for develo...
Heterogeneous platforms are mixes of different processing units in a compute node (e.g., CPUs+GPUs, ...
A desired trend within high energy physics is to increase particle accelerator luminosities, leading...
During the past decade, accelerators, such as NVIDIA CUDA GPUs and Intel Xeon Phis, have seen an inc...
Heterogeneous computing machines consisting of a CPU and one or more GPUs are increasingly being use...
With the ever-increasing amount of data and input variations, portable performance is becoming harde...
Data analysis is a rising field of interest for computer science research due to the growing amount ...
To help shrink the programmability-performance efficiency gap, we discuss that adaptive runtime syst...
The OpenCL standard allows targeting a large variety of CPU, GPU and accelerator architectures using...
Modern high-performance computers engage a variety of computing devices. Underutilization and oversu...
This thesis is composed of two parts, that relate to both parallel and heterogeneous processing. Th...
In recent years the designs of High Performance Computing (HPC) clusters have become more complex. T...
Many emerging applications require hardware acceleration due to their growing computational intensit...
Despite the fact that GPU was originally intended to be as a co-processor specializing in graphics r...
In recent years the designs of High Performance Computing (HPC) clusters have become more complex. T...
Measuring performance-critical characteristics of application workloads is important both for develo...
Heterogeneous platforms are mixes of different processing units in a compute node (e.g., CPUs+GPUs, ...
A desired trend within high energy physics is to increase particle accelerator luminosities, leading...
During the past decade, accelerators, such as NVIDIA CUDA GPUs and Intel Xeon Phis, have seen an inc...
Heterogeneous computing machines consisting of a CPU and one or more GPUs are increasingly being use...
With the ever-increasing amount of data and input variations, portable performance is becoming harde...
Data analysis is a rising field of interest for computer science research due to the growing amount ...
To help shrink the programmability-performance efficiency gap, we discuss that adaptive runtime syst...
The OpenCL standard allows targeting a large variety of CPU, GPU and accelerator architectures using...
Modern high-performance computers engage a variety of computing devices. Underutilization and oversu...
This thesis is composed of two parts, that relate to both parallel and heterogeneous processing. Th...
In recent years the designs of High Performance Computing (HPC) clusters have become more complex. T...
Many emerging applications require hardware acceleration due to their growing computational intensit...
Despite the fact that GPU was originally intended to be as a co-processor specializing in graphics r...
In recent years the designs of High Performance Computing (HPC) clusters have become more complex. T...