International audienceStencil computations are the basis to solve many problems related to Partial Differential Equations (PDEs). Obtaining the best performance with such numerical kernels is a major issue as many critical parameters (architectural features, compiler flags, memory policies, multithreading strategies) must be finely tuned. In this context, auto-tuning methods have been extensively used to improve the overall performance. However, the complexity of current architectures and the large number of optimizations to consider reduce the efficiency of this approach. This paper focuses on the use of Machine Learning to predict the performance of stencil kernels on multi-core architectures. Low-level hardware counters (e.g. cache-misse...
Multi-core and many-core were already major trends for the past six years, and are expected to conti...
International audienceStencil computation represents an important numerical kernel in scientific com...
Abstract—The microarchitectural design space of a new processor is too large for an architect to eva...
International audienceStencil computations are the basis to solve many problems related to Partial D...
AbstractIt is crucial to optimize stencil computations since they are the core (and most computation...
We propose and evaluate a novel strategy for tuning the performance of a class of stencil computatio...
The focus of this work is the automatic performance tuning of stencil computations on Graphics Proce...
Understanding the most efficient design and utilization of emerging multicore systems is one of the ...
Stencil computations are a class of algorithms operating on multi-dimensional arrays, which update a...
Communicated by Guest Editors Our aim is to apply program transformations to stencil codes in order ...
Scientific applications often require massive amounts of compute time and power. With the constantly...
The resurgence of machine learning since the late 1990s has been enabled by significant advances in ...
This work introduces a generalized framework for automatically tuning stencil computations to achiev...
Multi-core and many-core were already major trends for the past six years and are expected to contin...
Multi-core and many-core were already major trends for the past six years and are expected to contin...
Multi-core and many-core were already major trends for the past six years, and are expected to conti...
International audienceStencil computation represents an important numerical kernel in scientific com...
Abstract—The microarchitectural design space of a new processor is too large for an architect to eva...
International audienceStencil computations are the basis to solve many problems related to Partial D...
AbstractIt is crucial to optimize stencil computations since they are the core (and most computation...
We propose and evaluate a novel strategy for tuning the performance of a class of stencil computatio...
The focus of this work is the automatic performance tuning of stencil computations on Graphics Proce...
Understanding the most efficient design and utilization of emerging multicore systems is one of the ...
Stencil computations are a class of algorithms operating on multi-dimensional arrays, which update a...
Communicated by Guest Editors Our aim is to apply program transformations to stencil codes in order ...
Scientific applications often require massive amounts of compute time and power. With the constantly...
The resurgence of machine learning since the late 1990s has been enabled by significant advances in ...
This work introduces a generalized framework for automatically tuning stencil computations to achiev...
Multi-core and many-core were already major trends for the past six years and are expected to contin...
Multi-core and many-core were already major trends for the past six years and are expected to contin...
Multi-core and many-core were already major trends for the past six years, and are expected to conti...
International audienceStencil computation represents an important numerical kernel in scientific com...
Abstract—The microarchitectural design space of a new processor is too large for an architect to eva...