This thesis documents the analysis and optimization of a high-order finite difference computational fluid dynamics (CFD) application (PlasComCM). Performance bottlenecks were identified using performance tools and hardware counters. The performance analysis of PlasComCM showed that the quantity of memory accesses and the lack of vectorization inhibited optimal serial performance on a x86-based CPU. Optimizing techniques including pointer dereferencing, loop transformation and Fortran SIMD directives were applied to the top 10 time-consuming subroutines to remove obstacles to vectorization and to improve the serial performance. Details about the optimization techniques are presented and their impacts on performance are discussed. A 63% reduc...
AbstractModern multicore and manycore processors exhibit multiple levels of parallelism through a wi...
Modern computational resource have solidified the use of computer modeling as an integral part of th...
This paper presents a number of optimisations for improving the performance of unstructured computat...
This thesis documents the analysis and optimization of a high-order finite difference computational ...
This thesis presents a number of optimisations used for mapping the underlying computational pattern...
Hardware trends over the last decade show increasing complexity and heterogeneity in high performanc...
Numerous advancements made in the field of computational sciences have made CFD a viable solution to...
This paper describes performance tuning experiences with a parallel CFD code to enhance its performa...
L’importance de la dynamique des fluides numérique dans le processus de conception industrielle a au...
Computational fluid dynamics (CFD) is the main field of computational mechanics that has historicall...
Physics-based simulation, Computational Fluid Dynamics (CFD) in particular, has substantially reshap...
University of Minnesota Ph.D. dissertation. August 2013. Major: Computer science. Advisors: Prof. Pe...
This research focuses on evaluating and enhancing the performance of an in-house, structured, 2D CFD...
Computational fluid dynamics (CFD) is the main field of computational mechanics that has historicall...
We discuss the High Performance Fortran data parallel programming language as an aid to software eng...
AbstractModern multicore and manycore processors exhibit multiple levels of parallelism through a wi...
Modern computational resource have solidified the use of computer modeling as an integral part of th...
This paper presents a number of optimisations for improving the performance of unstructured computat...
This thesis documents the analysis and optimization of a high-order finite difference computational ...
This thesis presents a number of optimisations used for mapping the underlying computational pattern...
Hardware trends over the last decade show increasing complexity and heterogeneity in high performanc...
Numerous advancements made in the field of computational sciences have made CFD a viable solution to...
This paper describes performance tuning experiences with a parallel CFD code to enhance its performa...
L’importance de la dynamique des fluides numérique dans le processus de conception industrielle a au...
Computational fluid dynamics (CFD) is the main field of computational mechanics that has historicall...
Physics-based simulation, Computational Fluid Dynamics (CFD) in particular, has substantially reshap...
University of Minnesota Ph.D. dissertation. August 2013. Major: Computer science. Advisors: Prof. Pe...
This research focuses on evaluating and enhancing the performance of an in-house, structured, 2D CFD...
Computational fluid dynamics (CFD) is the main field of computational mechanics that has historicall...
We discuss the High Performance Fortran data parallel programming language as an aid to software eng...
AbstractModern multicore and manycore processors exhibit multiple levels of parallelism through a wi...
Modern computational resource have solidified the use of computer modeling as an integral part of th...
This paper presents a number of optimisations for improving the performance of unstructured computat...