This paper deals with optimizing time-iterated computations on periodic data domains. These computations are prevalent in computational sciences, particularly in partial differential equation solvers. We propose a fully automatic technique suitable for implementation in a compiler or in a domain-specific code generator for such computations. Dependence patterns on periodic data domains prevent existing algorithms from finding tiling opportunities. Our approach augments a state-of-the-art parallelization and locality-enhancing algorithm from the polyhedral framework to allow time-tiling of stencil computations on periodic domains. Experimental results on the swim SPEC CPU2000fp benchmark show a speedup of 5× and 4.2× over the highest SPEC pe...
In the framework of perfect loop nests with uniform dependences, tiling has been extensively studied...
Many computationally-intensive programs, such as those for differential equations, spatial interpola...
In the framework of fully permutable loops, tiling has been extensively studied as a source-to-sour...
Stencil computations are iterative kernels often used to simulate the change in a discretized spatia...
This paper fully develops Diamond Tiling, a technique to partition the computations of stencil appli...
Time-tiling is necessary for the efficient execution of iterative stencil computations. Classical ...
The key common bottleneck in most stencil codes is data movement, and prior research has shown that ...
Abstract—Loop tiling is a useful technique used to achieve cache optimization in scientific computat...
Abstract. This paper proposes tiling techniques based on data depen-dencies and not in code structur...
Most stencil computations allow tile-wise concurrent start, i.e., there always exists a face of the ...
Abstract Performance optimization of stencil computations has beenwidely studied in the literature, ...
Iteration space tiling is a common strategy used by parallelizing compilers and in performance tunin...
Most stencil computations allow tile-wise concurrent start, i.e., there always exists a face of the ...
Performance optimization of stencil computations has been widely studied in the literature, since th...
The authors consider the two dimensional periodic specifications: a method to specify succinctly obj...
In the framework of perfect loop nests with uniform dependences, tiling has been extensively studied...
Many computationally-intensive programs, such as those for differential equations, spatial interpola...
In the framework of fully permutable loops, tiling has been extensively studied as a source-to-sour...
Stencil computations are iterative kernels often used to simulate the change in a discretized spatia...
This paper fully develops Diamond Tiling, a technique to partition the computations of stencil appli...
Time-tiling is necessary for the efficient execution of iterative stencil computations. Classical ...
The key common bottleneck in most stencil codes is data movement, and prior research has shown that ...
Abstract—Loop tiling is a useful technique used to achieve cache optimization in scientific computat...
Abstract. This paper proposes tiling techniques based on data depen-dencies and not in code structur...
Most stencil computations allow tile-wise concurrent start, i.e., there always exists a face of the ...
Abstract Performance optimization of stencil computations has beenwidely studied in the literature, ...
Iteration space tiling is a common strategy used by parallelizing compilers and in performance tunin...
Most stencil computations allow tile-wise concurrent start, i.e., there always exists a face of the ...
Performance optimization of stencil computations has been widely studied in the literature, since th...
The authors consider the two dimensional periodic specifications: a method to specify succinctly obj...
In the framework of perfect loop nests with uniform dependences, tiling has been extensively studied...
Many computationally-intensive programs, such as those for differential equations, spatial interpola...
In the framework of fully permutable loops, tiling has been extensively studied as a source-to-sour...