his paper reports on the successful implementation of a massively parallel GPU-accelerated algorithm for the direct numerical simulation of turbulent mixing at high Schmidt number. The work stems from a recent development (Comput. Phys. Commun., vol. 219, 2017, 313-328), in which a low-communication algorithm was shown to attain high degrees of scalability on the Cray XE6 architecture when overlapping communication and computation via dedicated communication threads. An even higher level of performance has now been achieved using OpenMP 4.5 on the Cray XK7 architecture, where on each node the 16 integer cores of an AMD Interlagos processor share a single Nvidia K20X GPU accelerator. In the new algorithm, data movements are minimized by perf...
A serial source code for simulating a supersonic ejector flow is accelerated using parallelization b...
In this report we present a novel approach to model coupling for shared-memory multicore systems hos...
A large direct numerical simulation database spanning a wide range of Reynolds and Schmidt number is...
This paper reports on the successful implementation of a massively parallel GPU-accelerated algorith...
A new dual-communicator algorithm with very favorable performance characteristics has been developed...
An application was previously developed to simulate mixing chamber problems. So it is possible to pr...
Turbulent incompressible flows play an important role in a broad range of natural and industrial pro...
This paper describes the GPU accelerated MBFLO2 multi-block turbulent flow solver completely in doub...
We introduce algorithmic advancements designed to expedite simulations in OpenFOAM using GPUs. These...
Design optimization relies heavily on time-consuming simulations, especially when using gradient-fre...
OpenACC is a directive-based programing standard aim to provide a highly portable programming model ...
There is a growing need for ever more accurate climate and weather simulations to be delivered in sh...
A fast and economical solver, accelerated by the Graphics Process Units (GPU) of a single graphics c...
This paper presents GPU parallelization for a computational fluid dynamics solver which works on a m...
We investigate multi-level parallelism on GPU clusters with MPI-CUDA and hybrid MPI-OpenMP-CUDA para...
A serial source code for simulating a supersonic ejector flow is accelerated using parallelization b...
In this report we present a novel approach to model coupling for shared-memory multicore systems hos...
A large direct numerical simulation database spanning a wide range of Reynolds and Schmidt number is...
This paper reports on the successful implementation of a massively parallel GPU-accelerated algorith...
A new dual-communicator algorithm with very favorable performance characteristics has been developed...
An application was previously developed to simulate mixing chamber problems. So it is possible to pr...
Turbulent incompressible flows play an important role in a broad range of natural and industrial pro...
This paper describes the GPU accelerated MBFLO2 multi-block turbulent flow solver completely in doub...
We introduce algorithmic advancements designed to expedite simulations in OpenFOAM using GPUs. These...
Design optimization relies heavily on time-consuming simulations, especially when using gradient-fre...
OpenACC is a directive-based programing standard aim to provide a highly portable programming model ...
There is a growing need for ever more accurate climate and weather simulations to be delivered in sh...
A fast and economical solver, accelerated by the Graphics Process Units (GPU) of a single graphics c...
This paper presents GPU parallelization for a computational fluid dynamics solver which works on a m...
We investigate multi-level parallelism on GPU clusters with MPI-CUDA and hybrid MPI-OpenMP-CUDA para...
A serial source code for simulating a supersonic ejector flow is accelerated using parallelization b...
In this report we present a novel approach to model coupling for shared-memory multicore systems hos...
A large direct numerical simulation database spanning a wide range of Reynolds and Schmidt number is...