In this paper, we present an alternative implementation of the NANOS OpenMP runtime library (NthLib) that targets portability and efficient support of multiple levels of parallelism. We have implemented the runtime libraries of available open-source OpenMP compilers on top of NthLib, reducing thus their overheads and providing them with inherent support for nested parallelism. In addition, we present an experimental implementation of the workqueuing model and the parallelization of a data clustering algorithm using OpenMP directives. The asymmetry and non-determinism of this algorithm necessitate the exploitation of its nested loop-level parallelism. The experimental results on a SMP server with four processors demonstrate our efficient Ope...
In this paper we describe the extension of the CAPO parallelization support tool to support multilev...
This paper presents a new parallel programming environment called ParADE to enable easy, portable, ...
International audienceThis paper presents a high performance parallel implementation of a hierarchic...
In this paper, we present an alternative implementation of the NANOS OpenMP runtime library (NthLib)...
This paper describes the support provided by the NanosCompiler to nested parallelism in OpenMP. The ...
In this paper we describe the main components of the NanosCompiler, an OpenMP compiler whose impleme...
This paper describes the support provided by the NanosCompiler to nested parallelism in OpenMP. The ...
This paper presents a set of proposals for the OpenMP shared--memory programming model oriented tow...
This paper presents a set of proposals for the OpenMP shared-- memory programming model oriented to...
The OpenMP Application Programming Interface (API) is an emerging standard for parallel programming ...
In this work, we present an OpenMP implementation suitable for multiprogrammed environments on Intel...
This work was also published as a Rice University thesis/dissertation: http://hdl.handle.net/1911/18...
OpenMP is attracting wide-spread interest because of its easy-to-use parallel programming model for ...
In this paper, we show the potential benefits of translating OpenMP code to low-level parallel code ...
In this paper we describe the extension of the CAPO parallelization support tool to support multilev...
In this paper we describe the extension of the CAPO parallelization support tool to support multilev...
This paper presents a new parallel programming environment called ParADE to enable easy, portable, ...
International audienceThis paper presents a high performance parallel implementation of a hierarchic...
In this paper, we present an alternative implementation of the NANOS OpenMP runtime library (NthLib)...
This paper describes the support provided by the NanosCompiler to nested parallelism in OpenMP. The ...
In this paper we describe the main components of the NanosCompiler, an OpenMP compiler whose impleme...
This paper describes the support provided by the NanosCompiler to nested parallelism in OpenMP. The ...
This paper presents a set of proposals for the OpenMP shared--memory programming model oriented tow...
This paper presents a set of proposals for the OpenMP shared-- memory programming model oriented to...
The OpenMP Application Programming Interface (API) is an emerging standard for parallel programming ...
In this work, we present an OpenMP implementation suitable for multiprogrammed environments on Intel...
This work was also published as a Rice University thesis/dissertation: http://hdl.handle.net/1911/18...
OpenMP is attracting wide-spread interest because of its easy-to-use parallel programming model for ...
In this paper, we show the potential benefits of translating OpenMP code to low-level parallel code ...
In this paper we describe the extension of the CAPO parallelization support tool to support multilev...
In this paper we describe the extension of the CAPO parallelization support tool to support multilev...
This paper presents a new parallel programming environment called ParADE to enable easy, portable, ...
International audienceThis paper presents a high performance parallel implementation of a hierarchic...