Task and/or data parallelism can be exploited in most applications on today's multi-core processors. However, inefficient data organization, data dependencies, and hardware constraints limit scalable parallelization of these applications. In this thesis, performance and the impact of some optimizations is compared and evaluated for simple tasks using two parallel frameworks, OpenMP and Galois. Additionally, their performance on three real life applications, High Accuracy Relativistic Magnetohydrodynamics (HARM) which operates on a grid data structure; Delaunay Triangulation, which refines a triangulated mesh; and Dynamic Fracture Propagation, which operates on a triangulated mesh with adaptive refinement; is evaluated. It is found that Ope...
AbstractThis paper presents a fine grain parallel version of the 3D Delaunay Kernel procedure using ...
This dissertation presents optimization techniques for efficient data parallel formulation/implement...
Dense linear algebra libraries need to cope efficiently with a range of input problem sizes and shap...
Task and/or data parallelism can be exploited in most applications on today's multi-core processors....
Abstract. Irregular applications, i.e., programs that manipulate pointer-based data structures such ...
textThe Galois system provides a solution to the hard problem of parallelizing irregular algorithms ...
Article dans revue scientifique avec comité de lecture.The parallelization of a sequentiel molecular...
The majority of current HPC applications are composed of complex and irregular data structures that ...
This work examines performance characteristics of multiple shared-memory implementations of a probab...
Abstract In the prospect of the upcoming exa-scale era with millions of execution units, the questio...
The last several years have seen multicore architectures become ascendant in the computing world. As...
In prior work, we have proposed techniques to extend the ease of shared-memory parallel programming ...
In prior work, we have proposed techniques to extend the ease of shared-memory parallel programming ...
The thesis offers a comparison of OpenMP and Intel Threading Building blocks. The two are threading ...
In this masters thesis we explore past work trying to classify algorithmic problems. These classicat...
AbstractThis paper presents a fine grain parallel version of the 3D Delaunay Kernel procedure using ...
This dissertation presents optimization techniques for efficient data parallel formulation/implement...
Dense linear algebra libraries need to cope efficiently with a range of input problem sizes and shap...
Task and/or data parallelism can be exploited in most applications on today's multi-core processors....
Abstract. Irregular applications, i.e., programs that manipulate pointer-based data structures such ...
textThe Galois system provides a solution to the hard problem of parallelizing irregular algorithms ...
Article dans revue scientifique avec comité de lecture.The parallelization of a sequentiel molecular...
The majority of current HPC applications are composed of complex and irregular data structures that ...
This work examines performance characteristics of multiple shared-memory implementations of a probab...
Abstract In the prospect of the upcoming exa-scale era with millions of execution units, the questio...
The last several years have seen multicore architectures become ascendant in the computing world. As...
In prior work, we have proposed techniques to extend the ease of shared-memory parallel programming ...
In prior work, we have proposed techniques to extend the ease of shared-memory parallel programming ...
The thesis offers a comparison of OpenMP and Intel Threading Building blocks. The two are threading ...
In this masters thesis we explore past work trying to classify algorithmic problems. These classicat...
AbstractThis paper presents a fine grain parallel version of the 3D Delaunay Kernel procedure using ...
This dissertation presents optimization techniques for efficient data parallel formulation/implement...
Dense linear algebra libraries need to cope efficiently with a range of input problem sizes and shap...