AbstractThe architecture of high performance computing systems is becoming more and more heterogeneous, as accelerators play an increasingly important role alongside traditional CPUs. Programming heterogeneous systems efficiently is a complex task, that often requires the use of specific programming environments. Programming frameworks supporting codes portable across different high performance architectures have recently appeared, but one must carefully assess the relative costs of portability versus computing efficiency, and find a reasonable tradeoff point. In this paper we address precisely this issue, using as test-bench a Lattice Boltzmann code implemented in OpenCL. We analyze its performance on several different state-of-the-art pro...
The Unified Parallel C (UPC) language from the Partitioned Global Address Space (PGAS) family unifie...
Accelerator processors allow energy-efficient computation at high performance, especially for comput...
Accelerators are an increasingly common option to boost performance of codes that require extensive ...
Abstract The architecture of high performance computing systems is becoming more and more heteroge...
AbstractThe architecture of high performance computing systems is becoming more and more heterogeneo...
Abstract In this paper we report on our early experience on porting, optimizing and benchmarking a...
Scientific computing community has been in close connection with high performance computing (HPC), ...
In this paper we address the problem of identifying and exploiting techniques that optimize the perf...
This paper investigates the development of a molecular dynamics code that is highly portable between...
Accelerators are quickly emerging as the leading technology to further boost computing performances;...
A recent trend in scientific computing is the increasingly important role of co-processors, original...
With computer simulations real world phenomena can be analyzed in great detail. Computational fluid ...
AbstractWe present different kernels based on Lattice-Boltzmann methods for the solution of the two-...
This paper describes a massively parallel code for a state-of-the art thermal lattice–Boltzmann meth...
We present an auto-tuning approach to optimize application performance on emerging multicore archite...
The Unified Parallel C (UPC) language from the Partitioned Global Address Space (PGAS) family unifie...
Accelerator processors allow energy-efficient computation at high performance, especially for comput...
Accelerators are an increasingly common option to boost performance of codes that require extensive ...
Abstract The architecture of high performance computing systems is becoming more and more heteroge...
AbstractThe architecture of high performance computing systems is becoming more and more heterogeneo...
Abstract In this paper we report on our early experience on porting, optimizing and benchmarking a...
Scientific computing community has been in close connection with high performance computing (HPC), ...
In this paper we address the problem of identifying and exploiting techniques that optimize the perf...
This paper investigates the development of a molecular dynamics code that is highly portable between...
Accelerators are quickly emerging as the leading technology to further boost computing performances;...
A recent trend in scientific computing is the increasingly important role of co-processors, original...
With computer simulations real world phenomena can be analyzed in great detail. Computational fluid ...
AbstractWe present different kernels based on Lattice-Boltzmann methods for the solution of the two-...
This paper describes a massively parallel code for a state-of-the art thermal lattice–Boltzmann meth...
We present an auto-tuning approach to optimize application performance on emerging multicore archite...
The Unified Parallel C (UPC) language from the Partitioned Global Address Space (PGAS) family unifie...
Accelerator processors allow energy-efficient computation at high performance, especially for comput...
Accelerators are an increasingly common option to boost performance of codes that require extensive ...