This paper presents a benchmarking, performance analysis and optimisation study of the OP2 “active ” library, which provides an abstraction framework for the parallel execution of unstructured mesh applications. OP2 aims to decouple the scientific specification of the application from its parallel implementation, and thereby achieve code longevity and near-optimal performance through re-targeting the application to execute on different multi-core/many-core hardware. Runtime performance results are presented for a representative unstructured mesh application on a variety of many-core processor systems, including traditional X86 architectures from Intel (Xeon based on the older Penryn and current Nehalem micro-architectures) and GPU offerings...
Many-Task Computing (MTC) is a common scenario for multiple parallel systems, such as cluster, grids...
OpenCL (Open Computing Language) is a heterogeneous programming framework for developing application...
Recent developments in processor architecture have settled a shift from sequential processing to par...
This paper presents a performance analysis and benchmark-ing study of the OP2 “active ” library, whi...
OP2 is an "active " library framework for the solution of unstructured mesh-based applicat...
OP2 is an “active ” library framework for the solution of unstructured mesh applications. It aims to...
OP2 is an "active" library framework for the solution of unstructured mesh applications. It aims to ...
Hydra is a full-scale industrial CFD application used for the design of turbomachinery at Rolls Royc...
Applications based on unstructured meshes are typically compute intensive, leading to long running t...
Hydra is a full-scale industrial CFD application used for the design of turbomachinery at Rolls Royc...
Achieving optimal performance on the latest multi-core and many-core architectures increasingly depe...
Achieving optimal performance on the latest multi-core and many-core architectures depends more and ...
This paper studies the performance and energy consumption of several multi-core, multi-CPUs and many...
This paper explores the performance and energy efficiency of CUDA-enabled GPUs and multi-core SIMD C...
OP2/PyOP2: Framework for performance-portable parallel computations on unstructured meshesOP2/PyOP2:...
Many-Task Computing (MTC) is a common scenario for multiple parallel systems, such as cluster, grids...
OpenCL (Open Computing Language) is a heterogeneous programming framework for developing application...
Recent developments in processor architecture have settled a shift from sequential processing to par...
This paper presents a performance analysis and benchmark-ing study of the OP2 “active ” library, whi...
OP2 is an "active " library framework for the solution of unstructured mesh-based applicat...
OP2 is an “active ” library framework for the solution of unstructured mesh applications. It aims to...
OP2 is an "active" library framework for the solution of unstructured mesh applications. It aims to ...
Hydra is a full-scale industrial CFD application used for the design of turbomachinery at Rolls Royc...
Applications based on unstructured meshes are typically compute intensive, leading to long running t...
Hydra is a full-scale industrial CFD application used for the design of turbomachinery at Rolls Royc...
Achieving optimal performance on the latest multi-core and many-core architectures increasingly depe...
Achieving optimal performance on the latest multi-core and many-core architectures depends more and ...
This paper studies the performance and energy consumption of several multi-core, multi-CPUs and many...
This paper explores the performance and energy efficiency of CUDA-enabled GPUs and multi-core SIMD C...
OP2/PyOP2: Framework for performance-portable parallel computations on unstructured meshesOP2/PyOP2:...
Many-Task Computing (MTC) is a common scenario for multiple parallel systems, such as cluster, grids...
OpenCL (Open Computing Language) is a heterogeneous programming framework for developing application...
Recent developments in processor architecture have settled a shift from sequential processing to par...