Automatic parallelization is usually believed to be less effective at exploiting implicit parallelism in sparse/irregular programs than in their dense/regular counterparts. However, not much is really known because there have been few research reports on this topic. In this work, we have studied the possibility of using an automatic parallelizing compiler to detect the parallelism in sparse/irregular programs. The study with a collection of sparse/irregular programs led us to some common loop patterns. Based on these patterns new techniques were derived that produced good speedups when manually applied to our benchmark codes. More importantly, these parallelization methods can be implemented in a parallelizing compiler and can be applied au...
[[abstract]]©1999 IEEE-In our recent work, we have been working on providing parallel sparse support...
A large class of scientific and engineering applications may be classified as irregular and loosely ...
[[abstract]]Fortran 90 provides a rich set of array intrinsic functions that are useful for represen...
This paper studies howwell automatic parallelization techniques work on a collection of real codes w...
153 p.Thesis (Ph.D.)--University of Illinois at Urbana-Champaign, 2000.We have studied five differen...
147 p.Thesis (Ph.D.)--University of Illinois at Urbana-Champaign, 1997.The study of theoretical and ...
Current parallelizing compilers cannot identify a significant fraction of parallelizable loops becau...
Current parallelizing compilers cannot identify a significant fraction of parallelizable loops becau...
This paper proposes a new approach to improve data-parallel languages in the context of sparse and i...
This work was also published as a Rice University thesis/dissertation: http://hdl.handle.net/1911/16...
International audienceIn this paper, we propose a generic method of automatic parallelization for sp...
This paper presents several synthetic Fortran codes for testing the ability of automatic parallelize...
Thesis (Ph. D.--University of Rochester. Dept. of Computer Science, 1991. Simultaneously published i...
Abstract. This paper presents a compilation technique that performs automatic parallelization of can...
In this paper, we have presented the design and evalu-ation of a compiler system, called APE, f o r ...
[[abstract]]©1999 IEEE-In our recent work, we have been working on providing parallel sparse support...
A large class of scientific and engineering applications may be classified as irregular and loosely ...
[[abstract]]Fortran 90 provides a rich set of array intrinsic functions that are useful for represen...
This paper studies howwell automatic parallelization techniques work on a collection of real codes w...
153 p.Thesis (Ph.D.)--University of Illinois at Urbana-Champaign, 2000.We have studied five differen...
147 p.Thesis (Ph.D.)--University of Illinois at Urbana-Champaign, 1997.The study of theoretical and ...
Current parallelizing compilers cannot identify a significant fraction of parallelizable loops becau...
Current parallelizing compilers cannot identify a significant fraction of parallelizable loops becau...
This paper proposes a new approach to improve data-parallel languages in the context of sparse and i...
This work was also published as a Rice University thesis/dissertation: http://hdl.handle.net/1911/16...
International audienceIn this paper, we propose a generic method of automatic parallelization for sp...
This paper presents several synthetic Fortran codes for testing the ability of automatic parallelize...
Thesis (Ph. D.--University of Rochester. Dept. of Computer Science, 1991. Simultaneously published i...
Abstract. This paper presents a compilation technique that performs automatic parallelization of can...
In this paper, we have presented the design and evalu-ation of a compiler system, called APE, f o r ...
[[abstract]]©1999 IEEE-In our recent work, we have been working on providing parallel sparse support...
A large class of scientific and engineering applications may be classified as irregular and loosely ...
[[abstract]]Fortran 90 provides a rich set of array intrinsic functions that are useful for represen...