This is a post-peer-review, pre-copyedit version of an article published in Lecture Notes in Computer Science. The final authenticated version is available online at: https://doi.org/10.1007/978-3-540-30566-8_4[Abstract] A loop with irregular assignment computations contains loop-carried output data dependences that can only be detected at run-time. In this paper, a load-balanced method based on the inspector-executor model is proposed to parallelize this loop pattern. The basic idea lies in splitting the iteration space of the sequential loop into sets of conflict-free iterations that can be executed concurrently on different processors. As will be demonstrated, this method outperforms existing techniques. Irregular access patterns with di...
[Abstract] The parallelization of irregular algorithms has not been as widely studied as the one of ...
An important class of scientific codes access memory in an irregular manner. Because irregular acce...
In this paper we explore the idea of customizing and reusing loop schedules to improve the scalabili...
Abstract. A loop with irregular assignment computations contains loopcarried output data dependences...
This is a post-peer-review, pre-copyedit version of an article published. The final authenticated ve...
Abstract—Parallelization and locality optimization of affine loop nests has been successfully addres...
When the inter-iteration dependency pattern of the iterations of a loop cannot be determined statica...
[[abstract]]A run-time technique based on the inspector-executor scheme is proposed in this paper to...
In this paper, we present two new run-time algorithms for the parallelization of loops that have ind...
Optimistic parallelization is a promising approach for the parallelization of irregular algorithms: ...
The class of problems that can be effectively compiled by parallelizing compilers is discussed. This...
In adaptive irregular problems the data arrays are accessed via indirection arrays, and data access ...
Current parallelizing compilers do a reasonable job of extracting parallelism from programs with reg...
this article we investigate the trade-off between time and space efficiency in scheduling and execut...
Parallel computing hardware is ubiquitous, ranging from cell-phones with multiple cores to super-com...
[Abstract] The parallelization of irregular algorithms has not been as widely studied as the one of ...
An important class of scientific codes access memory in an irregular manner. Because irregular acce...
In this paper we explore the idea of customizing and reusing loop schedules to improve the scalabili...
Abstract. A loop with irregular assignment computations contains loopcarried output data dependences...
This is a post-peer-review, pre-copyedit version of an article published. The final authenticated ve...
Abstract—Parallelization and locality optimization of affine loop nests has been successfully addres...
When the inter-iteration dependency pattern of the iterations of a loop cannot be determined statica...
[[abstract]]A run-time technique based on the inspector-executor scheme is proposed in this paper to...
In this paper, we present two new run-time algorithms for the parallelization of loops that have ind...
Optimistic parallelization is a promising approach for the parallelization of irregular algorithms: ...
The class of problems that can be effectively compiled by parallelizing compilers is discussed. This...
In adaptive irregular problems the data arrays are accessed via indirection arrays, and data access ...
Current parallelizing compilers do a reasonable job of extracting parallelism from programs with reg...
this article we investigate the trade-off between time and space efficiency in scheduling and execut...
Parallel computing hardware is ubiquitous, ranging from cell-phones with multiple cores to super-com...
[Abstract] The parallelization of irregular algorithms has not been as widely studied as the one of ...
An important class of scientific codes access memory in an irregular manner. Because irregular acce...
In this paper we explore the idea of customizing and reusing loop schedules to improve the scalabili...