Abstract The architecture of high performance computing systems is becoming more and more heterogeneous, as accelerators play an increasingly important role alongside traditional CPUs. Programming heterogeneous systems efficiently is a complex task, that often requires the use of specific programming environments. Programming frameworks supporting codes portable across different high performance architectures have recently appeared, but one must carefully assess the relative costs of portability versus computing efficiency, and find a reasonable tradeoff point. In this paper we address precisely this issue, using as test-bench a Lattice Boltzmann code implemented in OpenCL. We analyze its performance on several different state-of-the-art ...
Accelerators are an increasingly common option to boost performance of codes that require extensive ...
A recent trend in scientific computing is the increasingly important role of co-processors, original...
We present a software approach to hardware-oriented numerics which builds upon an augmented, previou...
AbstractThe architecture of high performance computing systems is becoming more and more heterogeneo...
Abstract The architecture of high performance computing systems is becoming more and more heterogene...
Scientific computing community has been in close connection with high performance computing (HPC), ...
Abstract In this paper we report on our early experience on porting, optimizing and benchmarking a...
In this paper we address the problem of identifying and exploiting techniques that optimize the perf...
With computer simulations real world phenomena can be analyzed in great detail. Computational fluid ...
This paper investigates the development of a molecular dynamics code that is highly portable between...
Accelerators are quickly emerging as the leading technology to further boost computing performances;...
GPUs deliver higher performance than traditional processors, offering remarkable energy efficiency, ...
Heterogeneous clusters are a widely utilized class of supercomputers assembled from different types ...
The Unified Parallel C (UPC) language from the Partitioned Global Address Space (PGAS) family unifie...
Accelerator processors allow energy-efficient computation at high performance, especially for comput...
Accelerators are an increasingly common option to boost performance of codes that require extensive ...
A recent trend in scientific computing is the increasingly important role of co-processors, original...
We present a software approach to hardware-oriented numerics which builds upon an augmented, previou...
AbstractThe architecture of high performance computing systems is becoming more and more heterogeneo...
Abstract The architecture of high performance computing systems is becoming more and more heterogene...
Scientific computing community has been in close connection with high performance computing (HPC), ...
Abstract In this paper we report on our early experience on porting, optimizing and benchmarking a...
In this paper we address the problem of identifying and exploiting techniques that optimize the perf...
With computer simulations real world phenomena can be analyzed in great detail. Computational fluid ...
This paper investigates the development of a molecular dynamics code that is highly portable between...
Accelerators are quickly emerging as the leading technology to further boost computing performances;...
GPUs deliver higher performance than traditional processors, offering remarkable energy efficiency, ...
Heterogeneous clusters are a widely utilized class of supercomputers assembled from different types ...
The Unified Parallel C (UPC) language from the Partitioned Global Address Space (PGAS) family unifie...
Accelerator processors allow energy-efficient computation at high performance, especially for comput...
Accelerators are an increasingly common option to boost performance of codes that require extensive ...
A recent trend in scientific computing is the increasingly important role of co-processors, original...
We present a software approach to hardware-oriented numerics which builds upon an augmented, previou...