Several methods have been proposed in the literature for the distribution of data on distributed memory machines, either oriented to dense or sparse structures. Many of the real applications, however, deal with both kind of data jointly. This paper presents techniques for integrating dense and sparse array accesses in a way that optimizes locality and further allows an efficient loop partitioning within a data-parallel compiler. Our approach is evaluated through an experimental survey with several compilers and parallel platforms. The results prove the benefits of the BRS sparse distribution when combined with CYCLIC in mixed algorithms and the poor efficiency achieved by well-known distribution schemes when sparse elements arise in the sou...
A compiler and runtime support mechanism is described and demonstrated. The methods presented are ca...
In this paper, we develop an automatic compile-time computation and data decomposition technique for...
[[abstract]]©1997 SIAM-We present a compile-time method to select compression and distribution schem...
[[abstract]]A data distribution scheme of sparse arrays on a distributed memory multicomputer, in ge...
A data distribution scheme of sparse arrays on a distributed memory multicomputer, in general, is co...
[[abstract]]©2002 IEEE-A data distribution scheme of sparse arrays on a distributed memory multicomp...
Sparse matrix problems are difficult to parallelize efficiently on distributed memory machines since...
This paper proposes a new approach to improve data-parallel languages in the context of sparse and i...
[[abstract]]Fortran 90 provides a rich set of array intrinsic functions that are useful for represen...
In scalable multiprocessor systems, high performance demands that computational load be balanced eve...
[[abstract]]©1999 IEEE-In our recent work, we have been working on providing parallel sparse support...
Data-parallel languages, such as High Performance Fortran, are designed to make programming of distr...
Distributed-memory message-passing machines deliver scalable perfor-mance but are difficult to progr...
This paper outlines two methods which we believe will play an important role in any distributed memo...
This paper addresses the problem of partitioning data for distributed memory machines (multicomputer...
A compiler and runtime support mechanism is described and demonstrated. The methods presented are ca...
In this paper, we develop an automatic compile-time computation and data decomposition technique for...
[[abstract]]©1997 SIAM-We present a compile-time method to select compression and distribution schem...
[[abstract]]A data distribution scheme of sparse arrays on a distributed memory multicomputer, in ge...
A data distribution scheme of sparse arrays on a distributed memory multicomputer, in general, is co...
[[abstract]]©2002 IEEE-A data distribution scheme of sparse arrays on a distributed memory multicomp...
Sparse matrix problems are difficult to parallelize efficiently on distributed memory machines since...
This paper proposes a new approach to improve data-parallel languages in the context of sparse and i...
[[abstract]]Fortran 90 provides a rich set of array intrinsic functions that are useful for represen...
In scalable multiprocessor systems, high performance demands that computational load be balanced eve...
[[abstract]]©1999 IEEE-In our recent work, we have been working on providing parallel sparse support...
Data-parallel languages, such as High Performance Fortran, are designed to make programming of distr...
Distributed-memory message-passing machines deliver scalable perfor-mance but are difficult to progr...
This paper outlines two methods which we believe will play an important role in any distributed memo...
This paper addresses the problem of partitioning data for distributed memory machines (multicomputer...
A compiler and runtime support mechanism is described and demonstrated. The methods presented are ca...
In this paper, we develop an automatic compile-time computation and data decomposition technique for...
[[abstract]]©1997 SIAM-We present a compile-time method to select compression and distribution schem...