Stencil computations are widely used from physical simulations to machine-learning. They are embarrassingly parallel and perfectly fit modern hardware such as Graphic Processing Units. Although stencil computations have been extensively studied, optimizing them for increasingly diverse hardware remains challenging. Domain Specific Languages (DSLs) have raised the programming abstraction and offer good performance. However, this places the burden on DSL implementers who have to write almost full-fledged parallelizing compilers and optimizers. Lift has recently emerged as a promising approach to achieve performance portability and is based on a small set of reusable parallel primitives that DSL or library writers can build upon. Lift’s key...
Communicated by Guest Editors The implementation of stencil computations on modern, massively parall...
In this work, we present Dido, an implicitly parallel domain-specific language (DSL) that captures h...
In this paper, we present Patus, a code generation and auto-tuning framework for stencil computation...
Stencil computations are a widely used type of algorithm, found in applications from physical simula...
A widely used class of codes are stencil codes. Their general structure is very simple: data points ...
The implementation of stencil computations on modern, massively parallel systems with GPUs and other...
In this thesis, we introduce a new optimization theory for stencil-based applications which is cente...
Abstract Performance optimization of stencil computations has beenwidely studied in the literature, ...
dissertationStencil computations are operations on structured grids. They are frequently found in pa...
PDE discretization schemes yielding stencil-like computing patterns are commonly used for seismic mo...
The key common bottleneck in most stencil codes is data movement, and prior research has shown that ...
A straightforward implementation of an algorithm in a general-purpose programming language does usua...
Code transformations, such as loop tiling and loop fusion, are of key importance for the efficient i...
Stencil computations are not well optimized by general-purpose production compilers and the increase...
Stencil computations are an integral component of applications in a number of scientific computing d...
Communicated by Guest Editors The implementation of stencil computations on modern, massively parall...
In this work, we present Dido, an implicitly parallel domain-specific language (DSL) that captures h...
In this paper, we present Patus, a code generation and auto-tuning framework for stencil computation...
Stencil computations are a widely used type of algorithm, found in applications from physical simula...
A widely used class of codes are stencil codes. Their general structure is very simple: data points ...
The implementation of stencil computations on modern, massively parallel systems with GPUs and other...
In this thesis, we introduce a new optimization theory for stencil-based applications which is cente...
Abstract Performance optimization of stencil computations has beenwidely studied in the literature, ...
dissertationStencil computations are operations on structured grids. They are frequently found in pa...
PDE discretization schemes yielding stencil-like computing patterns are commonly used for seismic mo...
The key common bottleneck in most stencil codes is data movement, and prior research has shown that ...
A straightforward implementation of an algorithm in a general-purpose programming language does usua...
Code transformations, such as loop tiling and loop fusion, are of key importance for the efficient i...
Stencil computations are not well optimized by general-purpose production compilers and the increase...
Stencil computations are an integral component of applications in a number of scientific computing d...
Communicated by Guest Editors The implementation of stencil computations on modern, massively parall...
In this work, we present Dido, an implicitly parallel domain-specific language (DSL) that captures h...
In this paper, we present Patus, a code generation and auto-tuning framework for stencil computation...