We present Neon, a new programming model for grid-based computation with an intuitive, easy-to-use interface that allows domain experts to take full advantage of single-node multi-GPU systems. Neon decouples data structure from computation and back end configurations, allowing the same user code to operate on a variety of data structures and devices. Neon relies on a set of hierarchical abstractions that allow the user to write their applications as if they were sequential applications, while the runtime handles distribution across multiple GPUs and performs optimizations such as overlapping computation and communication without user intervention. We evaluate our programming model on several applications: a Lattice Boltzmann fluid solver, a...
The use of graphical processing units (GPUs) for general purpose calculations has gained a lot of at...
We present and compare different approaches for using multiple Graphics Processing Units in the simu...
Application programming for modern heterogeneous systems which comprise multi-core CPUs and multiple...
We present Neon, a new programming model for grid-based computation with an intuitive, easy-to-use i...
We present a software approach to hardware-oriented numerics which builds upon an augmented, previou...
We present Sailfish, an open source fluid simulation package implementing the lattice Boltzmann meth...
We describe a parallel implementation of a compressible Lattice Boltzmann code on a multi-GPU cluste...
The GPU programming model is primarily aimed at the development of applications that run one GPU. Ho...
Lattice-Boltzmann methods are versatile numerical modeling techniques capable of reproducing a wide ...
In this paper, we describe the implementation of a multi-GPU fluid flow solver based on the lattice ...
Over the last 20 years, the computing revolution has created many social benefits. The computing ene...
In this paper, we describe the implementation of a multi-graphical processing unit (GPU) fluid flow ...
Graphics Processing Units (GPUs) have been successfully used to accelerate scientific applications d...
Accelerators are an increasingly common option to boost performance of codes that require extensive ...
We present a software approach to hardware-oriented numerics which builds upon an augmented, previou...
The use of graphical processing units (GPUs) for general purpose calculations has gained a lot of at...
We present and compare different approaches for using multiple Graphics Processing Units in the simu...
Application programming for modern heterogeneous systems which comprise multi-core CPUs and multiple...
We present Neon, a new programming model for grid-based computation with an intuitive, easy-to-use i...
We present a software approach to hardware-oriented numerics which builds upon an augmented, previou...
We present Sailfish, an open source fluid simulation package implementing the lattice Boltzmann meth...
We describe a parallel implementation of a compressible Lattice Boltzmann code on a multi-GPU cluste...
The GPU programming model is primarily aimed at the development of applications that run one GPU. Ho...
Lattice-Boltzmann methods are versatile numerical modeling techniques capable of reproducing a wide ...
In this paper, we describe the implementation of a multi-GPU fluid flow solver based on the lattice ...
Over the last 20 years, the computing revolution has created many social benefits. The computing ene...
In this paper, we describe the implementation of a multi-graphical processing unit (GPU) fluid flow ...
Graphics Processing Units (GPUs) have been successfully used to accelerate scientific applications d...
Accelerators are an increasingly common option to boost performance of codes that require extensive ...
We present a software approach to hardware-oriented numerics which builds upon an augmented, previou...
The use of graphical processing units (GPUs) for general purpose calculations has gained a lot of at...
We present and compare different approaches for using multiple Graphics Processing Units in the simu...
Application programming for modern heterogeneous systems which comprise multi-core CPUs and multiple...