In this paper, the authors identify the scalability bottlenecks of an unstructured grid CFD code (PETSc-FUN3D) by studying the impact of several algorithmic and architectural parameters and by examining different programming models. The authors discuss the basic performance characteristics of this PDE code with the help of simple performance models developed in their earlier work, presenting primarily experimental results. In addition to achieving good per-processor performance (which has been addressed in the cited work and without which scalability claims are suspect) they strive to improve the implementation and convergence scalability of PETSc-FUN3D on thousands of processors
Abstract. Graphical Processing Units (GPUs) have shown acceleration factors over multicores for stru...
This paper highlights a three-year project by an interdisciplinary team on a legacy F77 computationa...
A computational Fluid Dynamics (CFD) code for steady simulations solves a set of non-linear partial ...
Prize winning PETSc-FUN3D aerodynamics code, extending it with highly-tuned shared-memory paralleliz...
This paper highlights a three-year project by an interdisciplinary team on a legacy F77 computationa...
The performance of scientic computing applications often achieves a small fraction of peak performan...
Abstract. With teraflops-scale computational modeling expected to be routine by 2003-04, under the t...
In order to run CFD codes more efficiently on large scales, the parallel computing has to be employe...
L'évolution constante ainsi que la complexification qui s'en suit des architectures matérielles obli...
L'évolution constante ainsi que la complexification qui s'en suit des architectures matérielles obli...
Since 2004, supercomputer growth hasbeen constrained by energy efficiency rather than raw hardware s...
Physics-based simulation, Computational Fluid Dynamics (CFD) in particular, has substantially reshap...
Physics-based simulation, Computational Fluid Dynamics (CFD) in particular, has substantially reshap...
This dissertation studies the sources of poor performance in scientific computing codes based on par...
This dissertation studies the sources of poor performance in scientific computing codes based on par...
Abstract. Graphical Processing Units (GPUs) have shown acceleration factors over multicores for stru...
This paper highlights a three-year project by an interdisciplinary team on a legacy F77 computationa...
A computational Fluid Dynamics (CFD) code for steady simulations solves a set of non-linear partial ...
Prize winning PETSc-FUN3D aerodynamics code, extending it with highly-tuned shared-memory paralleliz...
This paper highlights a three-year project by an interdisciplinary team on a legacy F77 computationa...
The performance of scientic computing applications often achieves a small fraction of peak performan...
Abstract. With teraflops-scale computational modeling expected to be routine by 2003-04, under the t...
In order to run CFD codes more efficiently on large scales, the parallel computing has to be employe...
L'évolution constante ainsi que la complexification qui s'en suit des architectures matérielles obli...
L'évolution constante ainsi que la complexification qui s'en suit des architectures matérielles obli...
Since 2004, supercomputer growth hasbeen constrained by energy efficiency rather than raw hardware s...
Physics-based simulation, Computational Fluid Dynamics (CFD) in particular, has substantially reshap...
Physics-based simulation, Computational Fluid Dynamics (CFD) in particular, has substantially reshap...
This dissertation studies the sources of poor performance in scientific computing codes based on par...
This dissertation studies the sources of poor performance in scientific computing codes based on par...
Abstract. Graphical Processing Units (GPUs) have shown acceleration factors over multicores for stru...
This paper highlights a three-year project by an interdisciplinary team on a legacy F77 computationa...
A computational Fluid Dynamics (CFD) code for steady simulations solves a set of non-linear partial ...