Abstract. Data-parallel programming facilitates elegant specification of concurrency. However, the composability of data-parallel operations so far has been constrained by the requirement to have only flat data-parallel operation at runtime. In this paper, we present early results on our work to exploit hardware support for nested concurrency to directly map nested data-parallel operations in high-level specifications to low-level codes that can be efficiently executed. To this effect, we have devised a compilation scheme from data-parallel operations in SaC to the sys-tems programming language of the Microgrid architecture. Furthermore, we present early empirical results to assert the viability of our approach.
Increased programmability for concurrent applications in distributed systems requires automatic supp...
Abstract: "Data-parallel programming languages have many desirable features, such as single-thread s...
This paper describes methods to adapt existing optimizing compilers for sequential languages to prod...
Data-parallel programming facilitates elegant specification of concurrency. However, the composabili...
We present preliminary results from compiling the high-level, functional and data-parallel programmi...
Contemporary parallel microprocessors exploit Chip Multiprocessing along with Single Instruction, Mu...
[[abstract]]A systematic procedure for designing pipelined data-parallel algorithms that are suitabl...
This paper describes an implementation technique for integrating nested data parallelism into an obj...
Development of parallel software is a very complex task. Many details, such as domain type, partitio...
Data-parallel languages, such as H scIGH P scERFORMANCE F scORTRAN or F scORTRAN D, provide a machin...
Data flow techniques have been around since the early ’70s when they were used in compilers for sequ...
The Sparc T3-4 server provides up to 512 concurrent hardware threads, a degree of concurrency that i...
Data flow techniques have been around since the early ’70s when they were used in compilers for seq...
In previous work, we have proposed a multithreaded execution model for describing nested data-parall...
The Sparc T3-4 server provides up to 512 concurrent hardware threads, a degree of concurrency that i...
Increased programmability for concurrent applications in distributed systems requires automatic supp...
Abstract: "Data-parallel programming languages have many desirable features, such as single-thread s...
This paper describes methods to adapt existing optimizing compilers for sequential languages to prod...
Data-parallel programming facilitates elegant specification of concurrency. However, the composabili...
We present preliminary results from compiling the high-level, functional and data-parallel programmi...
Contemporary parallel microprocessors exploit Chip Multiprocessing along with Single Instruction, Mu...
[[abstract]]A systematic procedure for designing pipelined data-parallel algorithms that are suitabl...
This paper describes an implementation technique for integrating nested data parallelism into an obj...
Development of parallel software is a very complex task. Many details, such as domain type, partitio...
Data-parallel languages, such as H scIGH P scERFORMANCE F scORTRAN or F scORTRAN D, provide a machin...
Data flow techniques have been around since the early ’70s when they were used in compilers for sequ...
The Sparc T3-4 server provides up to 512 concurrent hardware threads, a degree of concurrency that i...
Data flow techniques have been around since the early ’70s when they were used in compilers for seq...
In previous work, we have proposed a multithreaded execution model for describing nested data-parall...
The Sparc T3-4 server provides up to 512 concurrent hardware threads, a degree of concurrency that i...
Increased programmability for concurrent applications in distributed systems requires automatic supp...
Abstract: "Data-parallel programming languages have many desirable features, such as single-thread s...
This paper describes methods to adapt existing optimizing compilers for sequential languages to prod...