Programming a distributed system, such as a cluster, requires extended use of low-level communication libraries and can often become cumbersome and error prone for the average developer. In this work, we consider each node of a cluster as a separate OpenMP device, able to run code with OpenMP directives in parallel. We make use of the OpenMP device model to provide an easy and intuitive way to program available cluster nodes. Based on that, we examine modifications that were necessary to make existing task-based applications able to exploit such an infrastructure. Finally, we evaluate the performance of the system and conclude that one can gain significant speedup, as long as the application tasks do not produce excessive communication over...
This paper presents a new parallel programming environment called ParADE to enable easy, portable, ...
This paper advances the state-of-the-art in programming models for exploiting task-level parallelism...
OpenMP was not well recognized at the beginning of the project, around year 2003, because of its lim...
With the introduction of more powerful and massively parallel embedded processors, embedded systems ...
Cluster-based architectures are increasingly being adopted to design embedded many-cores. These plat...
Clusters of SMPs are ubiquitous. They have been traditionally programmed by using MPI. But, the prod...
Cluster-based architectures are increasingly being adopted to design embedded many-cores. These plat...
The most widely used node type in high-performance computing nowadays is a 2-socket server node. The...
This work was also published as a Rice University thesis/dissertation: http://hdl.handle.net/1911/18...
Tasking promises a model to program parallel applications that provides intuitive semantics. In the ...
Modern petascale and future exascale systems are massively heterogeneous architectures. Developing p...
MPI is the predominant model for parallel programming in technical high performance computing. With ...
This paper advances the state-of-the-art in programming models for exploiting task-level parallelism...
OpenMP has been very successful in exploiting structured parallelism in applications. With increasin...
OpenMP, as the de-facto standard programming model in symmetric multiprocessing for HPC, has seen it...
This paper presents a new parallel programming environment called ParADE to enable easy, portable, ...
This paper advances the state-of-the-art in programming models for exploiting task-level parallelism...
OpenMP was not well recognized at the beginning of the project, around year 2003, because of its lim...
With the introduction of more powerful and massively parallel embedded processors, embedded systems ...
Cluster-based architectures are increasingly being adopted to design embedded many-cores. These plat...
Clusters of SMPs are ubiquitous. They have been traditionally programmed by using MPI. But, the prod...
Cluster-based architectures are increasingly being adopted to design embedded many-cores. These plat...
The most widely used node type in high-performance computing nowadays is a 2-socket server node. The...
This work was also published as a Rice University thesis/dissertation: http://hdl.handle.net/1911/18...
Tasking promises a model to program parallel applications that provides intuitive semantics. In the ...
Modern petascale and future exascale systems are massively heterogeneous architectures. Developing p...
MPI is the predominant model for parallel programming in technical high performance computing. With ...
This paper advances the state-of-the-art in programming models for exploiting task-level parallelism...
OpenMP has been very successful in exploiting structured parallelism in applications. With increasin...
OpenMP, as the de-facto standard programming model in symmetric multiprocessing for HPC, has seen it...
This paper presents a new parallel programming environment called ParADE to enable easy, portable, ...
This paper advances the state-of-the-art in programming models for exploiting task-level parallelism...
OpenMP was not well recognized at the beginning of the project, around year 2003, because of its lim...