We focus on implementing and optimizing a sixth-order finite-difference solver for simulating compressible fluids on a GPU using third-order Runge-Kutta integration. Since graphics processing units perform well in data-parallel tasks, this makes them an attractive platform for fluid simulation. However, high-order stencil computation is memory-intensive with respect to both main memory and the caches of the GPU. We present two approaches for simulating compressible fluids using 55-point and 19-point stencils. We seek to reduce the requirements for memory bandwidth and cache size in our methods by using cache blocking and decomposing a latency-bound kernel into several bandwidth-bound kernels. Our fastest implementation is bandwidth-bound an...
The aim of the present paper is to report on our recent results for GPU accelerated simulations of c...
Power and energy consumption are becoming an increas-ing concern in high performance computing. Comp...
The continued development of improved algorithms and architecture for numerical simulations is at th...
We focus on implementing and optimizing a sixth-order finite-difference solver for simulating compre...
In this paper, the performance of the Cyclic Reduction (CR) algorithm for solving tridiagonal system...
Graphics processor units (GPU) that are traditionally designed for graphics rendering have emerged a...
Graphics processor units (GPU) that are originally designed for graphics rendering have emerged as m...
This paper introduces open-source computational fluid dynamics software named open computational flu...
In this thesis, a high-order incompressible Navier-Stokes solver is developed in the Python-based P...
The paper presents the results of the implementation of computational algorithms of hydrodynamics fo...
Computational fluid dynamics has seen a surge of popularity as a tool for visual effects animators o...
We present a parallel framework for simulating incompressible fluids with predictive-corrective inco...
We present a GPU-accelerated solver for the partitioned solution of fluid-structure interaction (FSI...
This paper presents a Graphics Processing Unit (GPU) acceleration of an iteration-based discrete vel...
The path to exascale computational fluid dynamics requires novel and disruptive hardware architectur...
The aim of the present paper is to report on our recent results for GPU accelerated simulations of c...
Power and energy consumption are becoming an increas-ing concern in high performance computing. Comp...
The continued development of improved algorithms and architecture for numerical simulations is at th...
We focus on implementing and optimizing a sixth-order finite-difference solver for simulating compre...
In this paper, the performance of the Cyclic Reduction (CR) algorithm for solving tridiagonal system...
Graphics processor units (GPU) that are traditionally designed for graphics rendering have emerged a...
Graphics processor units (GPU) that are originally designed for graphics rendering have emerged as m...
This paper introduces open-source computational fluid dynamics software named open computational flu...
In this thesis, a high-order incompressible Navier-Stokes solver is developed in the Python-based P...
The paper presents the results of the implementation of computational algorithms of hydrodynamics fo...
Computational fluid dynamics has seen a surge of popularity as a tool for visual effects animators o...
We present a parallel framework for simulating incompressible fluids with predictive-corrective inco...
We present a GPU-accelerated solver for the partitioned solution of fluid-structure interaction (FSI...
This paper presents a Graphics Processing Unit (GPU) acceleration of an iteration-based discrete vel...
The path to exascale computational fluid dynamics requires novel and disruptive hardware architectur...
The aim of the present paper is to report on our recent results for GPU accelerated simulations of c...
Power and energy consumption are becoming an increas-ing concern in high performance computing. Comp...
The continued development of improved algorithms and architecture for numerical simulations is at th...