Stencil computations are a widely used type of algorithm, found in applications from physical simulations to machine learning. Stencils are embarrassingly parallel, therefore fit on modern hardware such as Graphic Processing Units perfectly. Although stencil computations have been extensively studied, optimizing them for increasingly diverse hardware remains challenging. Domain-specific Languages (DSLs) have raised the programming abstraction and offer good performance; however, this method places the burden on DSL implementers to write almost full-fledged parallelizing compilers and optimizers. Lift has recently emerged as a promising approach to achieve performance portability by using a small set of reusable parallel primitives that D...
Most stencil computations allow tile-wise concurrent start, i.e., there always exists a face of the ...
As the cost of data movement increasingly dominates performance, developers of finite-volume and fin...
This paper fully develops Diamond Tiling, a technique to partition the computations of stencil appli...
Stencil computations are a widely used type of algorithm, found in applications from physical simula...
Stencil computations are widely used from physical simulations to machine-learning. They are embarra...
A widely used class of codes are stencil codes. Their general structure is very simple: data points ...
This thesis studies the techniques of tiling optimizations for stencil programs. Traditionally, res...
The key common bottleneck in most stencil codes is data movement, and prior research has shown that ...
The implementation of stencil computations on modern, massively parallel systems with GPUs and other...
dissertationStencil computations are operations on structured grids. They are frequently found in pa...
Stencil computations are iterative kernels often used to simulate the change in a discretized spatia...
In this thesis, we introduce a new optimization theory for stencil-based applications which is cente...
Abstract Performance optimization of stencil computations has beenwidely studied in the literature, ...
Our aim is to apply program transformations to stencil codes, in order to yield highest possible per...
Code transformations, such as loop tiling and loop fusion, are of key importance for the efficient i...
Most stencil computations allow tile-wise concurrent start, i.e., there always exists a face of the ...
As the cost of data movement increasingly dominates performance, developers of finite-volume and fin...
This paper fully develops Diamond Tiling, a technique to partition the computations of stencil appli...
Stencil computations are a widely used type of algorithm, found in applications from physical simula...
Stencil computations are widely used from physical simulations to machine-learning. They are embarra...
A widely used class of codes are stencil codes. Their general structure is very simple: data points ...
This thesis studies the techniques of tiling optimizations for stencil programs. Traditionally, res...
The key common bottleneck in most stencil codes is data movement, and prior research has shown that ...
The implementation of stencil computations on modern, massively parallel systems with GPUs and other...
dissertationStencil computations are operations on structured grids. They are frequently found in pa...
Stencil computations are iterative kernels often used to simulate the change in a discretized spatia...
In this thesis, we introduce a new optimization theory for stencil-based applications which is cente...
Abstract Performance optimization of stencil computations has beenwidely studied in the literature, ...
Our aim is to apply program transformations to stencil codes, in order to yield highest possible per...
Code transformations, such as loop tiling and loop fusion, are of key importance for the efficient i...
Most stencil computations allow tile-wise concurrent start, i.e., there always exists a face of the ...
As the cost of data movement increasingly dominates performance, developers of finite-volume and fin...
This paper fully develops Diamond Tiling, a technique to partition the computations of stencil appli...