Abstract—DENDRO [1] is a collection of tools for solving Finite Element problems in parallel. This package is written in C++ using the standard template library (STL) and uses the Message Passing (MPI) [2]. Dendro uses an octree data-structure to solve image-registration problems using finite element techniques. For analyzing the behavior of the package in terms of speed-up and scalability, it is important to know which part of the package is consuming most of the execution-time. The single node performance and the overall performance of the package is dependent on the code-organization and class-hierarchy. We used the PETSC [3] profiler to collect the performance statistics and instrument the code to know which part of the code takes most ...
HPC applications are often very complex and their behavior depends on a wide range of factors from a...
Abstract. Graphical Processing Units (GPUs) have shown acceleration factors over multicores for stru...
VAUCANSON is an extensive C++ library for the manipulation of finite state machines. Compared to its...
DENDRO is a collection of tools for solving Finite Element problems in parallel. This package is wri...
DENDRO is a collection of tools for solving Finite Element problems in parallel. This package is wri...
DENDRO is a collection of tools for solving Finite Element problems in parallel. This package is wri...
DENDRO is a collection of tools for solving Finite Element problems in parallel. This package is wri...
A normal Finite Element code typically takes about 30-40 % of the total time to cal-culate and assem...
The Portable, Extensible, Toolkit for Scientific Computation (PETSc) library package is a popular co...
This document outlines a simple method for benchmarking a parallel communication library and for usi...
Designers are confronted with high time-to-market pressure and an increasing demand for computationa...
With modern advancements in hardware and software technology scaling towards new limits, our compute...
pC++ is a language extension to C++ designed to allow programmers to compose distributed data struct...
Generally, parallel scientific applications are executed on a fixed number of processors determined ...
This thesis documents the analysis and optimization of a high-order finite difference computational ...
HPC applications are often very complex and their behavior depends on a wide range of factors from a...
Abstract. Graphical Processing Units (GPUs) have shown acceleration factors over multicores for stru...
VAUCANSON is an extensive C++ library for the manipulation of finite state machines. Compared to its...
DENDRO is a collection of tools for solving Finite Element problems in parallel. This package is wri...
DENDRO is a collection of tools for solving Finite Element problems in parallel. This package is wri...
DENDRO is a collection of tools for solving Finite Element problems in parallel. This package is wri...
DENDRO is a collection of tools for solving Finite Element problems in parallel. This package is wri...
A normal Finite Element code typically takes about 30-40 % of the total time to cal-culate and assem...
The Portable, Extensible, Toolkit for Scientific Computation (PETSc) library package is a popular co...
This document outlines a simple method for benchmarking a parallel communication library and for usi...
Designers are confronted with high time-to-market pressure and an increasing demand for computationa...
With modern advancements in hardware and software technology scaling towards new limits, our compute...
pC++ is a language extension to C++ designed to allow programmers to compose distributed data struct...
Generally, parallel scientific applications are executed on a fixed number of processors determined ...
This thesis documents the analysis and optimization of a high-order finite difference computational ...
HPC applications are often very complex and their behavior depends on a wide range of factors from a...
Abstract. Graphical Processing Units (GPUs) have shown acceleration factors over multicores for stru...
VAUCANSON is an extensive C++ library for the manipulation of finite state machines. Compared to its...