Optimizations are considered that are required for efficient execution of code segments that consists of loops over distributed data structures. The PARTI (Parallel Automated Runtime Toolkit at ICASE) execution time primitives are designed to carry out these optimizations and can be used to implement a wide range of scientific algorithms on distributed memory machines. These primitives allow the user to control array mappings in a way that gives an appearance of shared memory. Computations can be based on a global index set. Primitives are used to carry out gather and scatter operations on distributed arrays. Communications patterns are derived at runtime, and the appropriate send and receive messages are automatically generated
For better utilization of computing resources, it is important to consider parallel programming en...
The parallelization of several applications result in unstructured data accesses on coarse-grained, ...
In this paper we present several algorithms for performing all-to-many personalized communication on...
We consider optimizations that are required for efficient execution of code segments that consists o...
Primitives are presented that are designed to help users efficiently program irregular problems (e.g...
This paper describes a number of optimizations that can be used to support the efficient execution o...
Primitives are presented that are designed to help users efficiently program irregular problems (e.g...
A compiler and runtime support mechanism is described and demonstrated. The methods presented are ca...
In adaptive irregular problems the data arrays are accessed via indirection arrays, and data access ...
Sparse system solvers and general purpose codes for solving partial differential equations are examp...
In adaptive irregular problems the data arrays are accessed via indirection arrays, and data access ...
In many scientific applications, arrays containing data are indirectly indexed through indirection a...
Sparse matrix-vector (SpMV) multiplication is a widely used kernel in scientific applications. In th...
In highly adaptive irregular problems such as many Particle-In-Cell (PICJ codes and Dimet Simulatio...
This paper describes a set of primitives (PARTI) developed to efficiently execute unstructured and b...
For better utilization of computing resources, it is important to consider parallel programming en...
The parallelization of several applications result in unstructured data accesses on coarse-grained, ...
In this paper we present several algorithms for performing all-to-many personalized communication on...
We consider optimizations that are required for efficient execution of code segments that consists o...
Primitives are presented that are designed to help users efficiently program irregular problems (e.g...
This paper describes a number of optimizations that can be used to support the efficient execution o...
Primitives are presented that are designed to help users efficiently program irregular problems (e.g...
A compiler and runtime support mechanism is described and demonstrated. The methods presented are ca...
In adaptive irregular problems the data arrays are accessed via indirection arrays, and data access ...
Sparse system solvers and general purpose codes for solving partial differential equations are examp...
In adaptive irregular problems the data arrays are accessed via indirection arrays, and data access ...
In many scientific applications, arrays containing data are indirectly indexed through indirection a...
Sparse matrix-vector (SpMV) multiplication is a widely used kernel in scientific applications. In th...
In highly adaptive irregular problems such as many Particle-In-Cell (PICJ codes and Dimet Simulatio...
This paper describes a set of primitives (PARTI) developed to efficiently execute unstructured and b...
For better utilization of computing resources, it is important to consider parallel programming en...
The parallelization of several applications result in unstructured data accesses on coarse-grained, ...
In this paper we present several algorithms for performing all-to-many personalized communication on...