Massively parallel accelerators such as GPGPUs, manycores and FPGAs represent a powerful and affordable tool for scientists who look to speed up simulations of complex systems. However, porting code to such devices requires a detailed understanding of heterogeneous programming tools and effective strategies for parallelization. In this paper we present a source to source compilation approach with whole-program analysis to automatically transform single-threaded FORTRAN 77 legacy code into OpenCL-accelerated programs with parallelized kernels. The main contributions of our work are: (1) whole-source refactoring to allow any subroutine in the code to be offloaded to an accelerator. (2) Minimization of the data transfer between the host and...
Modern supercomputer architectures are evolving towards embedding more and more cores per compute no...
This paper reports on the development of an MPI/OpenCL implementation of LU, an application-level be...
Modern systems-on-chip augment their baseline CPU with coprocessors and accelerators to increase ove...
General-purpose GPU-based systems are highly attractive, as they give potentially massive performanc...
International audienceManycore architectures are now available in a wide range of HPC systems. Going...
There is a large body of legacy scientific code in use today that could benefit from execution on ac...
Heterogeneous multicores like GPGPUs are now commonplace in modern computing systems. Although heter...
A large number of Fortran legacy programs are still running in production environments, and most of ...
Significantly increasing intra-node parallelism is widely recognised as being a key prerequisite for...
SIMD hardware accelerators o er an alternative to manycores when energy consumption and performance ...
Initially driven by a strong need for increased computational performance in science and engineerin...
The rising pressure to simultaneously improve performance and reduce power consumption is driving mo...
In this report we present a novel approach to model coupling for shared-memory multicore systems hos...
In an ideal world, scientific applications would be expressed as high-level compositions of abstract...
GPU computing has established itself as a way to accelerate parallel codes in the high performance c...
Modern supercomputer architectures are evolving towards embedding more and more cores per compute no...
This paper reports on the development of an MPI/OpenCL implementation of LU, an application-level be...
Modern systems-on-chip augment their baseline CPU with coprocessors and accelerators to increase ove...
General-purpose GPU-based systems are highly attractive, as they give potentially massive performanc...
International audienceManycore architectures are now available in a wide range of HPC systems. Going...
There is a large body of legacy scientific code in use today that could benefit from execution on ac...
Heterogeneous multicores like GPGPUs are now commonplace in modern computing systems. Although heter...
A large number of Fortran legacy programs are still running in production environments, and most of ...
Significantly increasing intra-node parallelism is widely recognised as being a key prerequisite for...
SIMD hardware accelerators o er an alternative to manycores when energy consumption and performance ...
Initially driven by a strong need for increased computational performance in science and engineerin...
The rising pressure to simultaneously improve performance and reduce power consumption is driving mo...
In this report we present a novel approach to model coupling for shared-memory multicore systems hos...
In an ideal world, scientific applications would be expressed as high-level compositions of abstract...
GPU computing has established itself as a way to accelerate parallel codes in the high performance c...
Modern supercomputer architectures are evolving towards embedding more and more cores per compute no...
This paper reports on the development of an MPI/OpenCL implementation of LU, an application-level be...
Modern systems-on-chip augment their baseline CPU with coprocessors and accelerators to increase ove...