In this paper we explore the possibility of reusing schedules to improve the scalability of numerical codes in shared--memory architectures with non--uniform memory access. The main objective is to implicitly construct affinity links between threads and data accesses and reuse them as much as possible along the execution of the application. These links are created thorugh the definition and reuse of iteration schedules statically defined by the user or dinamically created at run time. The paper does not include a formal proposal of OpenMP extensions but includes some experiments showing the usefulness of constructing affinity links in some irregular codes.Peer Reviewe
This paper compares data distribution methodologies for scaling the performance of OpenMP on NUMA ar...
Scientific applications, like the ones involving numerical simulations, keep requiring more and more...
In recent years parallel computing has become ubiquitous. Lead by the spread of commodity multicore ...
In this paper we explore the idea of customizing and reusing loop schedules to improve the scalabili...
In this paper we explore the idea of customizing and reusing loop schedules to improve the scalabili...
Choosing the appropriate assignment of loop iterations to threads is one of the most important decis...
OpenMP has emerged as an important model and language extension for shared-memory parallel programmi...
Abstract. Thread affinity has appeared as an important technique to improve the overall program perf...
In this paper, we present the compiler transformation of OpenMP code to an ordered collection of tas...
Locality of computation is key to obtaining high performance on a broad variety of parallel architec...
Abstract. Nowadays shared memory HPC platforms expose a large number of cores organized in a hierarc...
In prior work, we have proposed techniques to extend the ease of shared-memory parallel programming ...
Shared memory parallel programming, for instance by inserting OpenMP pragmas into program code, migh...
This paper presents a set of proposals for the OpenMP shared-- memory programming model oriented to...
In prior work, we have proposed techniques to extend the ease of shared-memory parallel programming ...
This paper compares data distribution methodologies for scaling the performance of OpenMP on NUMA ar...
Scientific applications, like the ones involving numerical simulations, keep requiring more and more...
In recent years parallel computing has become ubiquitous. Lead by the spread of commodity multicore ...
In this paper we explore the idea of customizing and reusing loop schedules to improve the scalabili...
In this paper we explore the idea of customizing and reusing loop schedules to improve the scalabili...
Choosing the appropriate assignment of loop iterations to threads is one of the most important decis...
OpenMP has emerged as an important model and language extension for shared-memory parallel programmi...
Abstract. Thread affinity has appeared as an important technique to improve the overall program perf...
In this paper, we present the compiler transformation of OpenMP code to an ordered collection of tas...
Locality of computation is key to obtaining high performance on a broad variety of parallel architec...
Abstract. Nowadays shared memory HPC platforms expose a large number of cores organized in a hierarc...
In prior work, we have proposed techniques to extend the ease of shared-memory parallel programming ...
Shared memory parallel programming, for instance by inserting OpenMP pragmas into program code, migh...
This paper presents a set of proposals for the OpenMP shared-- memory programming model oriented to...
In prior work, we have proposed techniques to extend the ease of shared-memory parallel programming ...
This paper compares data distribution methodologies for scaling the performance of OpenMP on NUMA ar...
Scientific applications, like the ones involving numerical simulations, keep requiring more and more...
In recent years parallel computing has become ubiquitous. Lead by the spread of commodity multicore ...