Reordering of data is becoming more and more significant in order to achieve a higher performance in memory data access and, particularly, in program runtime. This fact be-comes specially important in parallel applications that are executed in shared memory systems. This work presents a new parallelizing, run time strategy for irregular structures asso-ciated to N-Body problem simulation algorithms. Such stra-tegy, so-called STPCLS (Step Classification), is based on the inspector-executor paradigm. It has been tested in a shared memory system using a significant set of irregular loops. The outcomes show that the efficiency of our solution is high, and the benefits overcome the overheads imposed by our algo-rithm. 1
We investigate conservative parallel discrete event simulations for logical circuits on shared-memor...
With traditional event list techniques, evaluating a detailed discrete event simulation model can of...
In this paper, we present two new parallel formulations of the Barnes-Hut method. These parallel for...
Abstract—Parallelization and locality optimization of affine loop nests has been successfully addres...
Parallel computing promises several orders of magnitude increase in our ability to solve realistic c...
Parallel computing hardware is ubiquitous, ranging from cell-phones with multiple cores to super-com...
textabstractParallel computation offers a challenging opportunity to speed up the time consuming enu...
this article we investigate the trade-off between time and space efficiency in scheduling and execut...
Abstract. A loop with irregular assignment computations contains loopcarried output data dependences...
[[abstract]]Circuit simulation is a very time-consuming and numerically intensive application, espec...
Discrete-event simulation, which is a major tool for analysis, prediction, and training, has been de...
PhD ThesisThis thesis develops and evaluates a number of efficient algorithms for performing paralle...
This is a post-peer-review, pre-copyedit version of an article published in Lecture Notes in Compute...
[[abstract]]A run-time technique based on the inspector-executor scheme is proposed in this paper to...
Simulation is a powerful technique to represent the evolution of realworld phenomena or systems ove...
We investigate conservative parallel discrete event simulations for logical circuits on shared-memor...
With traditional event list techniques, evaluating a detailed discrete event simulation model can of...
In this paper, we present two new parallel formulations of the Barnes-Hut method. These parallel for...
Abstract—Parallelization and locality optimization of affine loop nests has been successfully addres...
Parallel computing promises several orders of magnitude increase in our ability to solve realistic c...
Parallel computing hardware is ubiquitous, ranging from cell-phones with multiple cores to super-com...
textabstractParallel computation offers a challenging opportunity to speed up the time consuming enu...
this article we investigate the trade-off between time and space efficiency in scheduling and execut...
Abstract. A loop with irregular assignment computations contains loopcarried output data dependences...
[[abstract]]Circuit simulation is a very time-consuming and numerically intensive application, espec...
Discrete-event simulation, which is a major tool for analysis, prediction, and training, has been de...
PhD ThesisThis thesis develops and evaluates a number of efficient algorithms for performing paralle...
This is a post-peer-review, pre-copyedit version of an article published in Lecture Notes in Compute...
[[abstract]]A run-time technique based on the inspector-executor scheme is proposed in this paper to...
Simulation is a powerful technique to represent the evolution of realworld phenomena or systems ove...
We investigate conservative parallel discrete event simulations for logical circuits on shared-memor...
With traditional event list techniques, evaluating a detailed discrete event simulation model can of...
In this paper, we present two new parallel formulations of the Barnes-Hut method. These parallel for...