We report on the development of a new computational framework for efficiently carrying out parallel data redistribution in a limited memory environment. This new library, MADRE (The Memory-Aware Data Redistribution Engine), is an open-source, C/MPI-based toolkit de-signed for quick and easy integration into application codes that have demanding data migration needs. At the same time, MADRE exposes a lower-level API that greatly facilitates the devel-opment and incorporation of new algorithms into the MADRE framework, thus serving as a potential organizing entity for continued research in this area. Finally, we develop, describe, and test in detail several new parallel redistribution algorithms that are incorporated into the MADRE distributi...
Contemporary state-of-the-art runtime systems underlying widely utilized general purpose parallel pr...
This paper outlines the strategy and methodology used to create a portable coding environment for th...
This paper describes a framework for providing the ability to use multiple specialized data parallel...
As embedded systems are becoming the center of our digital life, system design becomes progressively...
International audienceIn this paper we concentrate on embedded parallel architectures with heterogen...
Applications are increasingly being executed on computational systems that have hierarchical paralle...
The goal of the Pandore system is to allow the execution of parallel algorithms on DMPC (Distributed...
When coupling applications running on distributed memory architectures or clusters, the coupling lib...
International audienceThis paper describes dstep, a directive-based programming model for hybrid sha...
Advances in computing and networking infrastructure have enabled an increasing number of application...
Shared-memory multiprocessor systems can achieve high performance levels when appropriate work paral...
Currently, most scientific applications based on MPI adopt a compute-centric architecture. Needed da...
The processing of massive amounts of data on clusters with finite amount of memory has become an imp...
Abstract. Array redistribution is required often in programs on distributed memory parallel computer...
This work was also published as a Rice University thesis/dissertation: http://hdl.handle.net/1911/16...
Contemporary state-of-the-art runtime systems underlying widely utilized general purpose parallel pr...
This paper outlines the strategy and methodology used to create a portable coding environment for th...
This paper describes a framework for providing the ability to use multiple specialized data parallel...
As embedded systems are becoming the center of our digital life, system design becomes progressively...
International audienceIn this paper we concentrate on embedded parallel architectures with heterogen...
Applications are increasingly being executed on computational systems that have hierarchical paralle...
The goal of the Pandore system is to allow the execution of parallel algorithms on DMPC (Distributed...
When coupling applications running on distributed memory architectures or clusters, the coupling lib...
International audienceThis paper describes dstep, a directive-based programming model for hybrid sha...
Advances in computing and networking infrastructure have enabled an increasing number of application...
Shared-memory multiprocessor systems can achieve high performance levels when appropriate work paral...
Currently, most scientific applications based on MPI adopt a compute-centric architecture. Needed da...
The processing of massive amounts of data on clusters with finite amount of memory has become an imp...
Abstract. Array redistribution is required often in programs on distributed memory parallel computer...
This work was also published as a Rice University thesis/dissertation: http://hdl.handle.net/1911/16...
Contemporary state-of-the-art runtime systems underlying widely utilized general purpose parallel pr...
This paper outlines the strategy and methodology used to create a portable coding environment for th...
This paper describes a framework for providing the ability to use multiple specialized data parallel...