Abstract. This article introduces a C++ template library dedicated at vectorizing algorithms for different target architectures: Multi-Target Parallel Skeleton (MTPS). Skeletons describing the data structures and algorithms are provided and allow MTPS to generate a code with optimized memory access patterns for the choosen architecture. MTPS currently supports x86-64 multicore CPUs and CUDA enabled GPUs. On these architectures, performances close to hardware limits are observed
This thesis addresses the challenges of developing multitarget code - that is to say, codes whose pe...
This thesis addresses the challenges of developing multitarget code - that is to say, codes whose pe...
International audienceThis paper describes a short and simple way of improving the performance of ve...
International audienceThis article introduces MTPS, a C++ template library dedicated at vectorizing ...
International audienceThis article introduces a C++ template library dedicated at vectorizing algori...
part of European Conference on Object-Oriented Programming - ECOOP 2009International audienceThis pa...
This paper discusses the design and implementation of a polymorphic collection library for distribut...
We present a new C++ library design for linear algebra computations on high performance architecture...
Tpetra is a C++ library for linear algebra computations on high-performance distributed node systems...
Due to energy constraints, high performance computing platforms are becoming increasingly heterogene...
International audienceThis article presents a framework for algorithmic skeletons that aims at repre...
How to program a parallel machine has always been a major research problem. Many tools, languages an...
Abstract. The Standard Template Adaptive Parallel Library (STAPL) is a par-allel library designed as...
GPGPUs and other accelerators are becoming a mainstream asset for high-performance computing. Raisin...
This report presents SkePU, a C++ template library which provides a simple and unified interface for...
This thesis addresses the challenges of developing multitarget code - that is to say, codes whose pe...
This thesis addresses the challenges of developing multitarget code - that is to say, codes whose pe...
International audienceThis paper describes a short and simple way of improving the performance of ve...
International audienceThis article introduces MTPS, a C++ template library dedicated at vectorizing ...
International audienceThis article introduces a C++ template library dedicated at vectorizing algori...
part of European Conference on Object-Oriented Programming - ECOOP 2009International audienceThis pa...
This paper discusses the design and implementation of a polymorphic collection library for distribut...
We present a new C++ library design for linear algebra computations on high performance architecture...
Tpetra is a C++ library for linear algebra computations on high-performance distributed node systems...
Due to energy constraints, high performance computing platforms are becoming increasingly heterogene...
International audienceThis article presents a framework for algorithmic skeletons that aims at repre...
How to program a parallel machine has always been a major research problem. Many tools, languages an...
Abstract. The Standard Template Adaptive Parallel Library (STAPL) is a par-allel library designed as...
GPGPUs and other accelerators are becoming a mainstream asset for high-performance computing. Raisin...
This report presents SkePU, a C++ template library which provides a simple and unified interface for...
This thesis addresses the challenges of developing multitarget code - that is to say, codes whose pe...
This thesis addresses the challenges of developing multitarget code - that is to say, codes whose pe...
International audienceThis paper describes a short and simple way of improving the performance of ve...