Abstract In unstructured finite volume method, loop on different mesh components such as cells, faces, nodes, etc is used widely for the traversal of data. Mesh loop results in direct or indirect data access that affects data locality significantly. By loop on mesh, many threads accessing the same data lead to data dependence. Both data locality and data dependence play an important part in the performance of GPU simulations. For optimizing a GPU-accelerated unstructured finite volume Computational Fluid Dynamics (CFD) program, the performance of hot spots under different loops on cells, faces, and nodes is evaluated on Nvidia Tesla V100 and K80. Numerical tests under different mesh scales show that the effects of mesh loop modes are differ...
Design optimization relies heavily on time-consuming simulations, especially when using gradient-fre...
In this paper, the authors identify the scalability bottlenecks of an unstructured grid CFD code (PE...
The advent of general purpose computing on graphics cards has led to significant software speedup in...
Key words: graphics processing units, unstructured finite volume method, computational fluid dynam...
Abstract. Graphical Processing Units (GPUs) have shown acceleration factors over multicores for stru...
Unstructured-mesh based numerical algorithms such as finite volume and finite element algorithms for...
AbstractComputational Fluid Dynamics (CFD) utilizes numerical solutions of Partial Differential Equa...
Abstract—Increasingly, the main bottleneck limiting performance on emerging multi-core and many-core...
This paper presents GPU parallelization for a computational fluid dynamics solver which works on a m...
This paper presents a number of optimisations for improving the performance of unstructured computat...
A new high-performance general-purpose graphics processing unit (GPGPU) computational fluid dynamics...
The study was undertaken as part of a larger effort to establish a common computational fluid dynami...
The continued development of improved algorithms and architecture for numerical simulations is at th...
This paper describes the GPU accelerated MBFLO2 multi-block turbulent flow solver completely in doub...
The Explicit Finite Element Method is a powerful tool in nonlinear dynamic finite element analysis. ...
Design optimization relies heavily on time-consuming simulations, especially when using gradient-fre...
In this paper, the authors identify the scalability bottlenecks of an unstructured grid CFD code (PE...
The advent of general purpose computing on graphics cards has led to significant software speedup in...
Key words: graphics processing units, unstructured finite volume method, computational fluid dynam...
Abstract. Graphical Processing Units (GPUs) have shown acceleration factors over multicores for stru...
Unstructured-mesh based numerical algorithms such as finite volume and finite element algorithms for...
AbstractComputational Fluid Dynamics (CFD) utilizes numerical solutions of Partial Differential Equa...
Abstract—Increasingly, the main bottleneck limiting performance on emerging multi-core and many-core...
This paper presents GPU parallelization for a computational fluid dynamics solver which works on a m...
This paper presents a number of optimisations for improving the performance of unstructured computat...
A new high-performance general-purpose graphics processing unit (GPGPU) computational fluid dynamics...
The study was undertaken as part of a larger effort to establish a common computational fluid dynami...
The continued development of improved algorithms and architecture for numerical simulations is at th...
This paper describes the GPU accelerated MBFLO2 multi-block turbulent flow solver completely in doub...
The Explicit Finite Element Method is a powerful tool in nonlinear dynamic finite element analysis. ...
Design optimization relies heavily on time-consuming simulations, especially when using gradient-fre...
In this paper, the authors identify the scalability bottlenecks of an unstructured grid CFD code (PE...
The advent of general purpose computing on graphics cards has led to significant software speedup in...