Abstract—We describe a set of lower-level abstractions to improve performance on modern large scale heterogeneous sys-tems. These provide portable access to system- and hardware-dependent features, automatically apply dynamic optimizations at run time, and target stencil-based codes used in finite differenc-ing, finite volume, or block-structured adaptive mesh refinement codes. These abstractions include a novel data structure to manage refinement information for block-structured adaptive mesh re-finement, an iterator mechanism to efficiently traverse multi-dimensional arrays in stencil-based codes, and a portable API and implementation for explicit SIMD vectorization. These abstractions can either be employed manually, or be targeted by au...
Over the last decade block-structured adaptive mesh refinement (SAMR) has found increasing use in la...
Improvements in the performance of processors and networks make it both feasible and interesting to ...
Application codes reliably achieve performance far less than the advertised capabilities of existing...
Starting from a high-level problem description in terms of partial differential equations using abst...
Starting from a high-level problem description in terms of partial differential equations using ab-s...
Starting from a high-level problem description in terms of partial differential equations using abst...
Stencil computations are a class of algorithms operating on multi-dimensional arrays, which update a...
In this paper we present research on improving the resilience of the execution of scientific softwar...
Abstract. We describe Cactus, a framework for building a variety of computing applications in scienc...
Improvements in the performance of processors and networks make it both feasible and interesting to ...
Abstract. We describe Cactus, a framework for building a variety of computing applications in scienc...
Scientific applications are critical for solving complex problems in many areas of research, and oft...
In this report we present research on applying a domain specific high-level abstractions development...
PDE discretization schemes yielding stencil-like computing patterns are commonly used for seismic mo...
As the adoption of parallel and heterogeneous systems increases, programming such systems also becom...
Over the last decade block-structured adaptive mesh refinement (SAMR) has found increasing use in la...
Improvements in the performance of processors and networks make it both feasible and interesting to ...
Application codes reliably achieve performance far less than the advertised capabilities of existing...
Starting from a high-level problem description in terms of partial differential equations using abst...
Starting from a high-level problem description in terms of partial differential equations using ab-s...
Starting from a high-level problem description in terms of partial differential equations using abst...
Stencil computations are a class of algorithms operating on multi-dimensional arrays, which update a...
In this paper we present research on improving the resilience of the execution of scientific softwar...
Abstract. We describe Cactus, a framework for building a variety of computing applications in scienc...
Improvements in the performance of processors and networks make it both feasible and interesting to ...
Abstract. We describe Cactus, a framework for building a variety of computing applications in scienc...
Scientific applications are critical for solving complex problems in many areas of research, and oft...
In this report we present research on applying a domain specific high-level abstractions development...
PDE discretization schemes yielding stencil-like computing patterns are commonly used for seismic mo...
As the adoption of parallel and heterogeneous systems increases, programming such systems also becom...
Over the last decade block-structured adaptive mesh refinement (SAMR) has found increasing use in la...
Improvements in the performance of processors and networks make it both feasible and interesting to ...
Application codes reliably achieve performance far less than the advertised capabilities of existing...