In this paper we describe the parallelization of the multi-zone code versions of the NAS Parallel Benchmarks employing multi-level OpenMP parallelism. For our study, we use the NanosCompiler that supports nesting of OpenMP directives and provides clauses to control the grouping of threads, load balancing, and synchronization. We report the benchmark results, compare the timings with those of different hybrid parallelization paradigms (MPI+OpenMP and MLP) and discuss OpenMP implementation issues that affect the performance of multi-level parallel applications.Peer Reviewe
The Computational Fluid Dynamics (CFD) solver TAU for unstructured grids is widely used in the Europ...
MPI is the predominant model for parallel programming in technical high performance computing. With ...
This paper describes the support provided by the NanosCompiler to nested parallelism in OpenMP. The ...
In this paper we describe the parallelization of the multi-zone code versions of the NAS Parallel Be...
This report describes a two level parallelization of a Computational Fluid Dynamic (CFD) solver with...
OpenFOAM, an open source industrial Computational Fluid Dynamics (CFD) tool, which contains dozens o...
Two alternative dual-level parallel implementations of the Multiblock Grid Princeton Ocean Model (MG...
In this paper we describe the extension of the CAPO parallelization support tool to support multilev...
This paper aims to report on the open multi-processing (OpenMP) parallel implementation of a fully u...
In this paper we describe the extension of the CAPO parallelization support tool to support multilev...
This paper describes the support provided by the NanosCompiler to nested parallelism in OpenMP. The ...
This white paper describes parallelization of the Slilab code with OpenMP for a shared-memory execut...
In this paper we describe the main components of the NanosCompiler, an OpenMP compiler whose impleme...
Recent developments at the NASA AMES Research Center's NAS Division have demonstrated that the new g...
Today most systems in high-performance computing (HPC) feature a hierarchical hardware design: share...
The Computational Fluid Dynamics (CFD) solver TAU for unstructured grids is widely used in the Europ...
MPI is the predominant model for parallel programming in technical high performance computing. With ...
This paper describes the support provided by the NanosCompiler to nested parallelism in OpenMP. The ...
In this paper we describe the parallelization of the multi-zone code versions of the NAS Parallel Be...
This report describes a two level parallelization of a Computational Fluid Dynamic (CFD) solver with...
OpenFOAM, an open source industrial Computational Fluid Dynamics (CFD) tool, which contains dozens o...
Two alternative dual-level parallel implementations of the Multiblock Grid Princeton Ocean Model (MG...
In this paper we describe the extension of the CAPO parallelization support tool to support multilev...
This paper aims to report on the open multi-processing (OpenMP) parallel implementation of a fully u...
In this paper we describe the extension of the CAPO parallelization support tool to support multilev...
This paper describes the support provided by the NanosCompiler to nested parallelism in OpenMP. The ...
This white paper describes parallelization of the Slilab code with OpenMP for a shared-memory execut...
In this paper we describe the main components of the NanosCompiler, an OpenMP compiler whose impleme...
Recent developments at the NASA AMES Research Center's NAS Division have demonstrated that the new g...
Today most systems in high-performance computing (HPC) feature a hierarchical hardware design: share...
The Computational Fluid Dynamics (CFD) solver TAU for unstructured grids is widely used in the Europ...
MPI is the predominant model for parallel programming in technical high performance computing. With ...
This paper describes the support provided by the NanosCompiler to nested parallelism in OpenMP. The ...