The next generation Sunway supercomputer employs the SW26010pro processor, which features a specialized on-chip heterogeneous architecture. Applications with significant hotspots can benefit from the great computation capacity improvement of Sunway many-core architectures by carefully making intensive manual many-core parallelization efforts. However, some legacy projects with large codebases, such as CESM, ROMS and WRF, contain numerous lines of code and do not have significant hotspots. The cost of manually porting such applications to the Sunway architecture is almost unaffordable. To overcome such a challenge, we have developed a toolkit named O2ATH. O2ATH forwards GNU OpenMP runtime library calls to Sunway's Athread library, which grea...
This paper presents HALO 1.0, an open-ended extensible multi-agent software framework that implement...
AbstractThe Open Community Runtime (OCR) is a recent effort in the search for a runtime for extreme ...
OpenMP provides a portable programming interface for shared memory parallel computers (SMPs). Althou...
With the introduction of more powerful and massively parallel embedded processors, embedded systems ...
The end of Dennard scaling and the slowdown of Moore's law led to a shift in technology trends towar...
Heterogeneous multicores like GPGPUs are now commonplace in modern computing systems. Although heter...
With the introduction of more powerful and massively parallel embedded processors, embedded systems ...
The Sunway TaihuLight supercomputer is the world's first system with a peak performance greater ...
Six months of HECToR dCSE funding was given to implement mixed-mode OpenMP parallelism in CP2K, buil...
Modern petascale and future exascale systems are massively heterogeneous architectures. Developing p...
The Sunway TaihuLight supercomputer is the world's first system with a peak performance greater than...
open5noopenMontagna, Fabio; Tagliavini, Giuseppe; Rossi, Davide; Garofalo, Angelo; Benini, LucaMonta...
We propose a novel computing runtime that exposes remote compute devices via the cross-vendor open h...
This paper reports on the development of an MPI/OpenCL implementation of LU, an application-level be...
he cloud microphysics scheme, CASIM, and the radiation scheme, SOCRATES, are two computationally int...
This paper presents HALO 1.0, an open-ended extensible multi-agent software framework that implement...
AbstractThe Open Community Runtime (OCR) is a recent effort in the search for a runtime for extreme ...
OpenMP provides a portable programming interface for shared memory parallel computers (SMPs). Althou...
With the introduction of more powerful and massively parallel embedded processors, embedded systems ...
The end of Dennard scaling and the slowdown of Moore's law led to a shift in technology trends towar...
Heterogeneous multicores like GPGPUs are now commonplace in modern computing systems. Although heter...
With the introduction of more powerful and massively parallel embedded processors, embedded systems ...
The Sunway TaihuLight supercomputer is the world's first system with a peak performance greater ...
Six months of HECToR dCSE funding was given to implement mixed-mode OpenMP parallelism in CP2K, buil...
Modern petascale and future exascale systems are massively heterogeneous architectures. Developing p...
The Sunway TaihuLight supercomputer is the world's first system with a peak performance greater than...
open5noopenMontagna, Fabio; Tagliavini, Giuseppe; Rossi, Davide; Garofalo, Angelo; Benini, LucaMonta...
We propose a novel computing runtime that exposes remote compute devices via the cross-vendor open h...
This paper reports on the development of an MPI/OpenCL implementation of LU, an application-level be...
he cloud microphysics scheme, CASIM, and the radiation scheme, SOCRATES, are two computationally int...
This paper presents HALO 1.0, an open-ended extensible multi-agent software framework that implement...
AbstractThe Open Community Runtime (OCR) is a recent effort in the search for a runtime for extreme ...
OpenMP provides a portable programming interface for shared memory parallel computers (SMPs). Althou...