Parallel programming is extremely challenging. Worse yet, parallel architectures evolve quickly, and parallel programs must often be refactored for each new architecture. It is highly desirable to provide performance portability, so programs developed on one architecture can deliver good performance on other architectures. This thesis is part of the AJITPar project that investigates a novel approach for achieving performance portability by the development of suitable cost models to inform scheduling decisions with dynamic information about computational and communication costs on the target architecture. The main artifact of the AJITPar project is the Adaptive Skeleton Library (ASL) that pro- vides a distributed-memory master-worker implem...
Languages for efficient parallel programming need to achieve high performance portability in order to...
This work has been partially supported by the EU Horizon 2020 grant “RePhrase: Refactoring Parallel ...
AbstractPortable, efficient, parallel programming requires cost models to compare different possible...
Parallel programming is extremely challenging. Worse yet, parallel architectures evolve quickly, and...
Tracing JIT compilation generates units of compilation that are easy to analyse and are known to exe...
Tracing JIT compilation generates units of compilation that are easy to analyse and are known to exe...
Tracing JIT compilation generates units of compilation that are easy to analyse and are known to ex...
The proliferation of widely available, but very different, parallel architectures makes the ability ...
The proliferation of widely available, but very different, parallel architectures makes the ability...
The proliferation of widely available, but very different, parallel architectures makes the ability...
Execution times may be reduced by offloading parallel loop nests to a GPU. Auto-parallelizing compil...
Institute for Computing Systems ArchitectureThis work presents an automatic cost-analysis system for...
. A new model of parallel computation is presented that provides appropriate cost models for automat...
Abstract: Languages for efficient parallel programming need to achieve high per-formance portability...
In this paper, we describe a model for determining the optimal data and computation decomposition fo...
Languages for efficient parallel programming need to achieve high performance portability in order to...
This work has been partially supported by the EU Horizon 2020 grant “RePhrase: Refactoring Parallel ...
AbstractPortable, efficient, parallel programming requires cost models to compare different possible...
Parallel programming is extremely challenging. Worse yet, parallel architectures evolve quickly, and...
Tracing JIT compilation generates units of compilation that are easy to analyse and are known to exe...
Tracing JIT compilation generates units of compilation that are easy to analyse and are known to exe...
Tracing JIT compilation generates units of compilation that are easy to analyse and are known to ex...
The proliferation of widely available, but very different, parallel architectures makes the ability ...
The proliferation of widely available, but very different, parallel architectures makes the ability...
The proliferation of widely available, but very different, parallel architectures makes the ability...
Execution times may be reduced by offloading parallel loop nests to a GPU. Auto-parallelizing compil...
Institute for Computing Systems ArchitectureThis work presents an automatic cost-analysis system for...
. A new model of parallel computation is presented that provides appropriate cost models for automat...
Abstract: Languages for efficient parallel programming need to achieve high per-formance portability...
In this paper, we describe a model for determining the optimal data and computation decomposition fo...
Languages for efficient parallel programming need to achieve high performance portability in order to...
This work has been partially supported by the EU Horizon 2020 grant “RePhrase: Refactoring Parallel ...
AbstractPortable, efficient, parallel programming requires cost models to compare different possible...