Abstract. This paper describes a combination of methods which make interprocedural data placement optimisation available to parallel libra-ries. We propose a delayed-evaluation, self-optimising (DESO) numerical library for a distributed-memory multicomputer. Delayed evaluation al-lows us to capture the control- ow of a user program from within the library at runtime, and to construct an optimised execution plan by propagating data placement constraints backwards through the DAG representing the computation to be performed. Our strategy for optimising data placements at runtime consists of an ecient representation for data distributions, a greedy optimisation al-gorithm, which because of delayed evaluation can take account of the full contex...
This thesis investigates methods of automatic allocation of processes to available processors in a g...
Many applications from scientific computing and physical simulations can benefit from a mixed task a...
This paper describes a number of optimizations that can be used to support the efficient execution o...
Abstract. This paper shows how data placement optimisation tech-niques which are normally only found...
. We are developing a lazy, self-optimising parallel library of vector-matrix routines. The aim is ...
This thesis concerns techniques for efficient runtime optimisation of regular parallel programs that...
Abstract. We are developing a lazy, self-optimising parallel library of vector-matrix routines. The ...
DESOLibraries are "delayed evaluation, selfoptimising " parallel libraries of numerical r...
Available from British Library Document Supply Centre- DSC:DXN063301 / BLDSC - British Library Docum...
The performance of a High Performance Parallel or Distributed Computation depends heavily on minimiz...
How can we provide data where it is required and when it is required to the execution units of para...
While parallel programming is needed to solve large-scale scientific applications, it is more diffic...
Abstract. Data locality optimization in parallel systems is a non-trivial task. This task is typical...
Abstract — A well organized parallel application can accomplish better performance over sequential e...
Massively Parallel Processor systems provide the required computational power to solve most large sc...
This thesis investigates methods of automatic allocation of processes to available processors in a g...
Many applications from scientific computing and physical simulations can benefit from a mixed task a...
This paper describes a number of optimizations that can be used to support the efficient execution o...
Abstract. This paper shows how data placement optimisation tech-niques which are normally only found...
. We are developing a lazy, self-optimising parallel library of vector-matrix routines. The aim is ...
This thesis concerns techniques for efficient runtime optimisation of regular parallel programs that...
Abstract. We are developing a lazy, self-optimising parallel library of vector-matrix routines. The ...
DESOLibraries are "delayed evaluation, selfoptimising " parallel libraries of numerical r...
Available from British Library Document Supply Centre- DSC:DXN063301 / BLDSC - British Library Docum...
The performance of a High Performance Parallel or Distributed Computation depends heavily on minimiz...
How can we provide data where it is required and when it is required to the execution units of para...
While parallel programming is needed to solve large-scale scientific applications, it is more diffic...
Abstract. Data locality optimization in parallel systems is a non-trivial task. This task is typical...
Abstract — A well organized parallel application can accomplish better performance over sequential e...
Massively Parallel Processor systems provide the required computational power to solve most large sc...
This thesis investigates methods of automatic allocation of processes to available processors in a g...
Many applications from scientific computing and physical simulations can benefit from a mixed task a...
This paper describes a number of optimizations that can be used to support the efficient execution o...