Starting from a high-level problem description in terms of partial differential equations using ab-stract tensor notation, the Chemora framework dis-cretizes, optimizes, and generates complete high performance codes for a wide range of compute architectures. Chemora extends the capabilities of Cactus, facilitating the usage of large-scale CPU/GPU systems in an efficient manner for com-plex applications, without low-level code tuning. Chemora achieves parallelism through MPI and multi-threading, combining OpenMP and CUDA. Optimizations include high-level code transforma-tions, efficient loop traversal strategies, dynam-ically selected data and instruction cache usage strategies, and JIT compilation of GPU code tai-lored to the problem charac...
2012-04-27The shift to many-core architecture design paradigm in computer market has provided unprec...
We present Cholla (Computational Hydrodynamics On ParaLLel Architectures), a new three-dimensional h...
During the first decade of the twenty-first century, the advent of multicore processing reached its ...
Starting from a high-level problem description in terms of partial differential equations using ab-s...
Starting from a high-level problem description in terms of partial differential equations using abst...
Starting from a high-level problem description in terms of partial differential equations using abst...
Modern HPC architectures consist of heterogeneous multicore, many-node systems with deep memory hier...
In an ideal world, scientific applications would be expressed as high-level compositions of abstract...
The paper presents a new hydrodynamical code, OMPEGAS, for the 3D simulation of astrophysical flows ...
Abstract—We describe a set of lower-level abstractions to improve performance on modern large scale ...
Since the beginning of the field of high performance computing (HPC) after World War II, there has b...
In this work we propose two different parallel versions of the software package COMPSYN, devoted to ...
Stencil computations are a class of algorithms operating on multi-dimensional arrays, which update a...
We propose a two-stage optimization strategy to accelerate non-equilibrium ionization (NEI) calculat...
This thesis presents methods to run scientific code safely on a global-scale desktop grid. Current a...
2012-04-27The shift to many-core architecture design paradigm in computer market has provided unprec...
We present Cholla (Computational Hydrodynamics On ParaLLel Architectures), a new three-dimensional h...
During the first decade of the twenty-first century, the advent of multicore processing reached its ...
Starting from a high-level problem description in terms of partial differential equations using ab-s...
Starting from a high-level problem description in terms of partial differential equations using abst...
Starting from a high-level problem description in terms of partial differential equations using abst...
Modern HPC architectures consist of heterogeneous multicore, many-node systems with deep memory hier...
In an ideal world, scientific applications would be expressed as high-level compositions of abstract...
The paper presents a new hydrodynamical code, OMPEGAS, for the 3D simulation of astrophysical flows ...
Abstract—We describe a set of lower-level abstractions to improve performance on modern large scale ...
Since the beginning of the field of high performance computing (HPC) after World War II, there has b...
In this work we propose two different parallel versions of the software package COMPSYN, devoted to ...
Stencil computations are a class of algorithms operating on multi-dimensional arrays, which update a...
We propose a two-stage optimization strategy to accelerate non-equilibrium ionization (NEI) calculat...
This thesis presents methods to run scientific code safely on a global-scale desktop grid. Current a...
2012-04-27The shift to many-core architecture design paradigm in computer market has provided unprec...
We present Cholla (Computational Hydrodynamics On ParaLLel Architectures), a new three-dimensional h...
During the first decade of the twenty-first century, the advent of multicore processing reached its ...