International audienceEfficiently using the hardware capabilities of the Cell processor, a heterogeneous chip multiprocessor that uses several levels of parallelism to deliver high performance, and being able to reuse legacy code are real challenges for application developers. We propose to use Generative Programming and more precisely template meta-programing to design an Embedded Domain Specific Language using algorithmic skeletons to generate applications based on a high-level mapping description. The method is easy to use by developers and delivers performance close to the performance of optimized hand-written code, as shown on various benchmarks ranging from simple BLAS kernels to image processing applications
International audienceAlgorithmic skeletons, introduced by Cole, were designed to ease the developme...
International audienceWhen developing software in Operational Research (OR), one usually aims at get...
International audienceThe design and implementation of high level tools for parallel programming is ...
International audienceEfficiently using the hardware capabilities of the Cell processor, a heterogen...
Abstract. Special purpose embedded languages facilitate generating high-performance code from purely...
International audienceThe quality of compiler-optimized code for high-performance applications lags ...
AbstractThe quality of compiler-optimized code for high-performance applications is far behind what ...
The quality of compiler-optimized code for high-performance applications lags way behind what optimi...
Algorithmic skeletons can be used to write architecture independent programs, shielding application ...
International audienceThis article presents a framework for algorithmic skeletons that aims at repre...
The Cell Broadband Engine processor is a powerful processor capable of over 220 GFLOPS. It is highly...
International audienceAlgorithmic skeletons, introduced by Cole, were designed to ease the developme...
International audienceWhen developing software in Operational Research (OR), one usually aims at get...
International audienceThe design and implementation of high level tools for parallel programming is ...
International audienceEfficiently using the hardware capabilities of the Cell processor, a heterogen...
Abstract. Special purpose embedded languages facilitate generating high-performance code from purely...
International audienceThe quality of compiler-optimized code for high-performance applications lags ...
AbstractThe quality of compiler-optimized code for high-performance applications is far behind what ...
The quality of compiler-optimized code for high-performance applications lags way behind what optimi...
Algorithmic skeletons can be used to write architecture independent programs, shielding application ...
International audienceThis article presents a framework for algorithmic skeletons that aims at repre...
The Cell Broadband Engine processor is a powerful processor capable of over 220 GFLOPS. It is highly...
International audienceAlgorithmic skeletons, introduced by Cole, were designed to ease the developme...
International audienceWhen developing software in Operational Research (OR), one usually aims at get...
International audienceThe design and implementation of high level tools for parallel programming is ...