dissertationStencil computations are operations on structured grids. They are frequently found in partial differential equation solvers, making their performance critical to a range of scientific applications. On modern architectures where data movement costs dominate computation, optimizing stencil computations is a challenging task. Typically, domain scientists must reduce and orchestrate data movement to tackle the memory bandwidth and latency bottlenecks. Furthermore, optimized code must map efficiently to ever increasing parallelism on a chip. This dissertation studies several stencils with varying arithmetic intensities, thus requiring contrasting optimization strategies. Stencils traditionally have low arithmetic intensity, making t...
We propose and evaluate a novel strategy for tuning the performance of a class of stencil computatio...
thesisAt the beginning of the 21st century, it became apparent that the performance gains associated...
The growth of data to be processed in the Oil & Gas industry matches the requirements imposed by evo...
A widely used class of codes are stencil codes. Their general structure is very simple: data points ...
The optimization of data parallel programs is a challenging open problem. We analyzed in detail the ...
As the cost of data movement increasingly dominates performance, developers of finite-volume and fin...
In this paper, we present Patus, a code generation and auto-tuning framework for stencil computation...
Stencil computations are a class of algorithms operating on multi-dimensional arrays, which update a...
Understanding the most efficient design and utilization of emerging multicore systems is one of the ...
International audienceStencil computation represents an important numerical kernel in scientific com...
Mathematicians and computational scientists are often limited in their ability to model complex phen...
AbstractIn this paper we investigate how stencil computations can be implemented on state-of-the-art...
PDE discretization schemes yielding stencil-like computing patterns are commonly used for seismic mo...
Application codes reliably achieve performance far less than the advertised capabilities of existing...
In this thesis, we introduce a new optimization theory for stencil-based applications which is cente...
We propose and evaluate a novel strategy for tuning the performance of a class of stencil computatio...
thesisAt the beginning of the 21st century, it became apparent that the performance gains associated...
The growth of data to be processed in the Oil & Gas industry matches the requirements imposed by evo...
A widely used class of codes are stencil codes. Their general structure is very simple: data points ...
The optimization of data parallel programs is a challenging open problem. We analyzed in detail the ...
As the cost of data movement increasingly dominates performance, developers of finite-volume and fin...
In this paper, we present Patus, a code generation and auto-tuning framework for stencil computation...
Stencil computations are a class of algorithms operating on multi-dimensional arrays, which update a...
Understanding the most efficient design and utilization of emerging multicore systems is one of the ...
International audienceStencil computation represents an important numerical kernel in scientific com...
Mathematicians and computational scientists are often limited in their ability to model complex phen...
AbstractIn this paper we investigate how stencil computations can be implemented on state-of-the-art...
PDE discretization schemes yielding stencil-like computing patterns are commonly used for seismic mo...
Application codes reliably achieve performance far less than the advertised capabilities of existing...
In this thesis, we introduce a new optimization theory for stencil-based applications which is cente...
We propose and evaluate a novel strategy for tuning the performance of a class of stencil computatio...
thesisAt the beginning of the 21st century, it became apparent that the performance gains associated...
The growth of data to be processed in the Oil & Gas industry matches the requirements imposed by evo...