AbstractAiming to understand how high-performance CUDA programming can be done for NVIDIA's new Kepler architecture, we have investigated a specific case of simulating sediment transport. The arisen stencil computations have distinct features connected to the two nonlinear partial differential equations that constitute the mathematical model. Consequently, the required CUDA programming effort differs for the two corresponding CUDA kernel functions. While Kepler's new read-only data cache brings enough benefits for one kernel function, performance of the other kernel function is further enhanceable through using the shared memory and so-called halo threads. The highest achieved performance of the stencil computation amounts to 190.45 GFLOPs ...
In this thesis we look at a performance bottleneck of running molecular dynamics code on GPGPU devic...
Graphics Processing Units (GPUs) are quickly becoming viable alternatives for certain simulations cu...
AbstractWe optimized Moving Particle Simulation (MPS) method for Kepler GPU. Solving sparse matrix o...
Aiming to understand how high-performance CUDA programming can be done for NVIDIA's new Kepler archi...
We describe our experience using NVIDIA's CUDA (Compute Unified Device Architecture) C programming e...
As hydrological data becomes more in-depth and is measured at higher resolutions, the need for a fas...
ROMS is software that models and simulates an ocean region using a finite difference grid and time s...
The most commonly used approach for solving reaction–diffusion systems relies upon stencil computati...
In recent years, along with the higher GPU’s computational speed and memory bandwidth compared to th...
An existing hybrid MPI-OpenMP scheme is augmented with a CUDA-based fine grain parallelization appro...
AbstractThe increasing computing power of graphics processing units (GPU) has motivated the use of G...
Since the first version of CUDA was launch, many improvements were made in GPU computing. Every new ...
Graphics processor units (GPU) that are originally designed for graphics rendering have emerged as m...
Restricted solid on solid surface growth models can be mapped onto binary lattice gases. We show tha...
Graphics processor units (GPU) that are traditionally designed for graphics rendering have emerged a...
In this thesis we look at a performance bottleneck of running molecular dynamics code on GPGPU devic...
Graphics Processing Units (GPUs) are quickly becoming viable alternatives for certain simulations cu...
AbstractWe optimized Moving Particle Simulation (MPS) method for Kepler GPU. Solving sparse matrix o...
Aiming to understand how high-performance CUDA programming can be done for NVIDIA's new Kepler archi...
We describe our experience using NVIDIA's CUDA (Compute Unified Device Architecture) C programming e...
As hydrological data becomes more in-depth and is measured at higher resolutions, the need for a fas...
ROMS is software that models and simulates an ocean region using a finite difference grid and time s...
The most commonly used approach for solving reaction–diffusion systems relies upon stencil computati...
In recent years, along with the higher GPU’s computational speed and memory bandwidth compared to th...
An existing hybrid MPI-OpenMP scheme is augmented with a CUDA-based fine grain parallelization appro...
AbstractThe increasing computing power of graphics processing units (GPU) has motivated the use of G...
Since the first version of CUDA was launch, many improvements were made in GPU computing. Every new ...
Graphics processor units (GPU) that are originally designed for graphics rendering have emerged as m...
Restricted solid on solid surface growth models can be mapped onto binary lattice gases. We show tha...
Graphics processor units (GPU) that are traditionally designed for graphics rendering have emerged a...
In this thesis we look at a performance bottleneck of running molecular dynamics code on GPGPU devic...
Graphics Processing Units (GPUs) are quickly becoming viable alternatives for certain simulations cu...
AbstractWe optimized Moving Particle Simulation (MPS) method for Kepler GPU. Solving sparse matrix o...