This paper describes a new concept for the parallel implementation of functional languages on a network of processors. The implementation uses a special variant of annotated graph reduction [3]. The main features of it are the following: We employ active waiting [6], to avoid complicated runtime data structures. We use a global address space, and a random distribution of the graph nodes over the local memories of the processors, in order to overcome the problems of load-balancing and scheduling. The reduction is organized in cycles during which, all annotated redices are reduced. This notion of "cycles" enables us, to restrict communication between the processors to the execution of a global permutation, defined by an array of mes...
An extensible machine architecture is devised to efficiently support a parallel reduction model of c...
A clustered architecture has been designed to exploit divide and conquer parallelism in functional p...
A clustered architecture has been designed to exploit divide and conquer parallelism in functional p...
This thesis describes a model for distributed graph reduction implemented on a network of transputer...
Many models for the parallel reduction of lazy functional languages have been proposed in the litera...
Programmed graph reduction has been shown to be an efficient implementation technique for lazy funct...
Distributed implementations of programming languages with implicit parallelism hold out the prospect...
technical reportAn abstract machine suitable for parallel graph reduction on a shared memory multipr...
We describe the use and distributed implementation of a functional language based on the graph-reduc...
A fundamental issue affecting the performance of a parallel application running on message-passing p...
A language for a parallel, logic graph reduction system is described in this dissertation. The langu...
This paper describes a computer architecture for execution of lazy functional languages. The archite...
Load balancing in large parallel systems with distributed memory is a difficult task often influenci...
Journal ArticleThis paper describes an effective means for programming shared memory multiprocessors...
A method is outlined for optimising graph partitions which arise in mapping unstructured mesh calcul...
An extensible machine architecture is devised to efficiently support a parallel reduction model of c...
A clustered architecture has been designed to exploit divide and conquer parallelism in functional p...
A clustered architecture has been designed to exploit divide and conquer parallelism in functional p...
This thesis describes a model for distributed graph reduction implemented on a network of transputer...
Many models for the parallel reduction of lazy functional languages have been proposed in the litera...
Programmed graph reduction has been shown to be an efficient implementation technique for lazy funct...
Distributed implementations of programming languages with implicit parallelism hold out the prospect...
technical reportAn abstract machine suitable for parallel graph reduction on a shared memory multipr...
We describe the use and distributed implementation of a functional language based on the graph-reduc...
A fundamental issue affecting the performance of a parallel application running on message-passing p...
A language for a parallel, logic graph reduction system is described in this dissertation. The langu...
This paper describes a computer architecture for execution of lazy functional languages. The archite...
Load balancing in large parallel systems with distributed memory is a difficult task often influenci...
Journal ArticleThis paper describes an effective means for programming shared memory multiprocessors...
A method is outlined for optimising graph partitions which arise in mapping unstructured mesh calcul...
An extensible machine architecture is devised to efficiently support a parallel reduction model of c...
A clustered architecture has been designed to exploit divide and conquer parallelism in functional p...
A clustered architecture has been designed to exploit divide and conquer parallelism in functional p...