Efficient data motion is critical for high performance computing on distributed memory architectures. The value of some techniques for efficient data motion is illustrated by identifying generic communication primitives. Further, the efficiency of these primitives is demonstrated on three different applications using the finite element method for unstructured grids and sparse solvers with different communication requirements. For the applications presented, the techniques advocated reduced the communication times by a factor of between 1.5 - 3.Engineering and Applied Science
One of the concerns often expressed about the use of object oriented methods in numerical software i...
The motivation of this thesis was to develop strategies that would enable unstructured mesh based co...
Explicit finite-element methods are widely used for problems of non-linear structural mechanics. Bec...
Massively parallel computing holds the promise of extreme performance. Critical for achieving high p...
This paper demonstrates that scalability and competitive efficiency can be achieved for unstructured...
AbstractWe discuss some aspects of implementing the finite-element method on parallel computers with...
Abstract. In this paper we address various efficiency aspects of finite element (FE) simulations on ...
This paper describes the methods used and experiences made with implementing a finite element applic...
This paper describes the methods used and experiences made with implementing a finite element applic...
Performance of finite element solvers on parallel computers such as the Connection Machine CM-5 syst...
Processor technology is still dramatically advancing and promises enormous improvements in processin...
In parallel finite element solvers, sparse matrix assembly is often a bottleneck. Implemented using ...
This paper highlights a three-year project by an interdisciplinary team on a legacy F77 computationa...
A new, efficient pipelined communication strategy that significantly reduces latency for parallel 3-...
This report characterizes a family of unstructured 3D finite element simulations that are partitione...
One of the concerns often expressed about the use of object oriented methods in numerical software i...
The motivation of this thesis was to develop strategies that would enable unstructured mesh based co...
Explicit finite-element methods are widely used for problems of non-linear structural mechanics. Bec...
Massively parallel computing holds the promise of extreme performance. Critical for achieving high p...
This paper demonstrates that scalability and competitive efficiency can be achieved for unstructured...
AbstractWe discuss some aspects of implementing the finite-element method on parallel computers with...
Abstract. In this paper we address various efficiency aspects of finite element (FE) simulations on ...
This paper describes the methods used and experiences made with implementing a finite element applic...
This paper describes the methods used and experiences made with implementing a finite element applic...
Performance of finite element solvers on parallel computers such as the Connection Machine CM-5 syst...
Processor technology is still dramatically advancing and promises enormous improvements in processin...
In parallel finite element solvers, sparse matrix assembly is often a bottleneck. Implemented using ...
This paper highlights a three-year project by an interdisciplinary team on a legacy F77 computationa...
A new, efficient pipelined communication strategy that significantly reduces latency for parallel 3-...
This report characterizes a family of unstructured 3D finite element simulations that are partitione...
One of the concerns often expressed about the use of object oriented methods in numerical software i...
The motivation of this thesis was to develop strategies that would enable unstructured mesh based co...
Explicit finite-element methods are widely used for problems of non-linear structural mechanics. Bec...