Abstract. We are developing a lazy, self-optimising parallel library of vector-matrix routines. The aim is to allow users to parallelise certain computationally expensive parts of numerical programs by simply link-ing with a parallel rather than sequential library of subroutines. The li-brary performs interprocedural data placement optimisation at runtime, which requires the optimiser itself to be very ecient. We achieve this rstly by working from aggregate loop nests which have been optimised in isolation, and secondly by using a carefully constructed mathemati-cal formulation for data distributions and the distribution requirements of library operators, which allows us largely to replace searching with calculation in our algorithm.
While parallel programming is needed to solve large-scale scientific applications, it is more diffic...
Two issues in linear algebra algorithms for multicomputers are addressed. First, how tounify paralle...
Abstract. Sparse matrix-vector multiplication forms the heart of iterative linear solvers used widel...
. We are developing a lazy, self-optimising parallel library of vector-matrix routines. The aim is ...
Abstract. This paper describes a combination of methods which make interprocedural data placement op...
DESOLibraries are "delayed evaluation, selfoptimising " parallel libraries of numerical r...
Runtime specialization optimizes programs based on partial information available only at run time. I...
Abstract. This paper shows how data placement optimisation tech-niques which are normally only found...
This thesis concerns techniques for efficient runtime optimisation of regular parallel programs that...
Runtime specialization optimizes programs based on partial infor-mation available only at run time. ...
Abstract—This paper presents a data layout optimization technique for sequential and parallel progra...
Due to copyright restrictions, the access to the full text of this article is only available via sub...
Available from British Library Document Supply Centre- DSC:DXN063301 / BLDSC - British Library Docum...
The multiplication of a sparse matrix with a dense vector is a performance critical computational ke...
Data and computation alignment is an important part of compiling sequential programs to architecture...
While parallel programming is needed to solve large-scale scientific applications, it is more diffic...
Two issues in linear algebra algorithms for multicomputers are addressed. First, how tounify paralle...
Abstract. Sparse matrix-vector multiplication forms the heart of iterative linear solvers used widel...
. We are developing a lazy, self-optimising parallel library of vector-matrix routines. The aim is ...
Abstract. This paper describes a combination of methods which make interprocedural data placement op...
DESOLibraries are "delayed evaluation, selfoptimising " parallel libraries of numerical r...
Runtime specialization optimizes programs based on partial information available only at run time. I...
Abstract. This paper shows how data placement optimisation tech-niques which are normally only found...
This thesis concerns techniques for efficient runtime optimisation of regular parallel programs that...
Runtime specialization optimizes programs based on partial infor-mation available only at run time. ...
Abstract—This paper presents a data layout optimization technique for sequential and parallel progra...
Due to copyright restrictions, the access to the full text of this article is only available via sub...
Available from British Library Document Supply Centre- DSC:DXN063301 / BLDSC - British Library Docum...
The multiplication of a sparse matrix with a dense vector is a performance critical computational ke...
Data and computation alignment is an important part of compiling sequential programs to architecture...
While parallel programming is needed to solve large-scale scientific applications, it is more diffic...
Two issues in linear algebra algorithms for multicomputers are addressed. First, how tounify paralle...
Abstract. Sparse matrix-vector multiplication forms the heart of iterative linear solvers used widel...