Selected for presentation at the HiPEAC 2013 Conf.International audienceThis paper addresses the compilation of a sequential program for parallel execution on a modern GPU. To this end, we present a novel source-to-source compiler called PPCG. PPCG singles out for its ability to accelerate computations from any static control loop nest, generating multiple CUDA kernels when necessary. We introduce a multilevel tiling strategy and a code generation scheme for the parallelization and locality optimization of imperfectly nested loops, managing memory and exposing concurrency according to the constraints of modern GPUs. We evaluate our algorithms and tool on the entire PolyBench suite
International audienceWe propose a framework based on an original generation and use of algorithmic ...
The shift toward parallel processor architectures has made programming and code generation increasin...
AbstractGraphics processor units (GPUs) have evolved to handle throughput oriented workloads where a...
Selected for presentation at the HiPEAC 2013 Conf.International audienceThis paper addresses the com...
International audienceAutomatic parallelization is becoming more important as parallelism becomes ub...
have emerged as a powerful accelerator for general-purpose computations. GPUs are attached to every ...
state.edu GPUs are a class of specialized parallel architectures with tremen-dous computational powe...
Recent advances in multi-core and many-core processors re-quires programmers to exploit an increasin...
This thesis proposes new extensions to the code generation phase in polyhedral compilers. The main f...
Recent advances in multi-core and many-core processors requires programmers to exploit an increasing...
6 pagesInternational audienceParallel and heterogeneous computing are growing in audience thanks to ...
In this paper, we discuss techniques to transform sequential programs to texture/surface memory opt...
Abstract—Recently GPUs have risen as one important par-allel platform for general purpose applicatio...
Graphics Processing Units (GPU) have been widely adopted to accelerate the execution of HPC workload...
International audienceWe propose a framework based on an original generation and use of algorithmic ...
The shift toward parallel processor architectures has made programming and code generation increasin...
AbstractGraphics processor units (GPUs) have evolved to handle throughput oriented workloads where a...
Selected for presentation at the HiPEAC 2013 Conf.International audienceThis paper addresses the com...
International audienceAutomatic parallelization is becoming more important as parallelism becomes ub...
have emerged as a powerful accelerator for general-purpose computations. GPUs are attached to every ...
state.edu GPUs are a class of specialized parallel architectures with tremen-dous computational powe...
Recent advances in multi-core and many-core processors re-quires programmers to exploit an increasin...
This thesis proposes new extensions to the code generation phase in polyhedral compilers. The main f...
Recent advances in multi-core and many-core processors requires programmers to exploit an increasing...
6 pagesInternational audienceParallel and heterogeneous computing are growing in audience thanks to ...
In this paper, we discuss techniques to transform sequential programs to texture/surface memory opt...
Abstract—Recently GPUs have risen as one important par-allel platform for general purpose applicatio...
Graphics Processing Units (GPU) have been widely adopted to accelerate the execution of HPC workload...
International audienceWe propose a framework based on an original generation and use of algorithmic ...
The shift toward parallel processor architectures has made programming and code generation increasin...
AbstractGraphics processor units (GPUs) have evolved to handle throughput oriented workloads where a...