In this work, a novel graphics processing unit (GPU) distributed sharing model for hybrid many-core architectures is introduced and employed in the acceleration of a production-level computational fluid dynamics (CFD) code. The latest generation graphics hardware allows multiple processor cores to simultaneously share a single GPU through concurrent kernel execution. This feature has allowed the NASA FUN3D code to be accelerated in parallel with up to four processor cores sharing a single GPU. For codes to scale and fully use resources on these and the next generation machines, codes will need to employ some type of GPU sharing model, as presented in this work. Findings include the effects of GPU sharing on overall performance. A discussion...
We investigate multi-level parallelism on GPU clusters with MPI-CUDA and hybrid MPI-OpenMP-CUDA para...
AbstractA parallel finite-volume CFD algorithm for modeling of incompressible flows on hybrid superc...
A parallel finite-volume CFD algorithm for modeling of incompressible flows on hybrid supercomputers...
GPU computing has established itself as a way to accelerate parallel codes in the high performance c...
AbstractIncreased levels of parallel CFD by utilizing graphical processing units (GPUs) in a high pe...
Modern graphics processing units (GPUs) with many-core architectures have emerged as general-purpose...
Modern graphics processing units (GPUs) with many-core architectures have emerged as general-purpose...
A new high-performance general-purpose graphics processing unit (GPGPU) computational fluid dynamics...
A parallelization study designed for ADI-type algorithms is presented using the OpenMP specification...
High performance computing using graphics processing units (GPUs) is gaining popularity in the scien...
Graphics processor units (GPU) that are originally designed for graphics rendering have emerged as m...
Graphics processor units (GPU) that are traditionally designed for graphics rendering have emerged a...
Graphics processing units (GPUs) have a strong floating-point capability and a high memory bandwidth...
Computational Fluid Dynamics (CFD) is an important field in high performance computing with numerous...
In this PhD thesis, we present our research in the domain of high performance software for computati...
We investigate multi-level parallelism on GPU clusters with MPI-CUDA and hybrid MPI-OpenMP-CUDA para...
AbstractA parallel finite-volume CFD algorithm for modeling of incompressible flows on hybrid superc...
A parallel finite-volume CFD algorithm for modeling of incompressible flows on hybrid supercomputers...
GPU computing has established itself as a way to accelerate parallel codes in the high performance c...
AbstractIncreased levels of parallel CFD by utilizing graphical processing units (GPUs) in a high pe...
Modern graphics processing units (GPUs) with many-core architectures have emerged as general-purpose...
Modern graphics processing units (GPUs) with many-core architectures have emerged as general-purpose...
A new high-performance general-purpose graphics processing unit (GPGPU) computational fluid dynamics...
A parallelization study designed for ADI-type algorithms is presented using the OpenMP specification...
High performance computing using graphics processing units (GPUs) is gaining popularity in the scien...
Graphics processor units (GPU) that are originally designed for graphics rendering have emerged as m...
Graphics processor units (GPU) that are traditionally designed for graphics rendering have emerged a...
Graphics processing units (GPUs) have a strong floating-point capability and a high memory bandwidth...
Computational Fluid Dynamics (CFD) is an important field in high performance computing with numerous...
In this PhD thesis, we present our research in the domain of high performance software for computati...
We investigate multi-level parallelism on GPU clusters with MPI-CUDA and hybrid MPI-OpenMP-CUDA para...
AbstractA parallel finite-volume CFD algorithm for modeling of incompressible flows on hybrid superc...
A parallel finite-volume CFD algorithm for modeling of incompressible flows on hybrid supercomputers...