In writing parallel programs, programmers expose parallelism and optimize it to meet a particular performance goal on a single platform under an assumed set of workload characteristics. In the field, changing workload characteristics, new parallel platforms, and deployments with different performance goals make the programmer's development-time choices suboptimal. To address this problem, this paper presents the Degree of Parallelism Executive (DoPE), an API and run-time system that separates the concern of exposing parallelism from that of optimizing it. Using the DoPE API, the application developer expresses parallelism options. During program execution, DoPE's run-time system uses this information to dynamically optimize the pa...
Today multicore technology is the standard processor technology for desktop and laptop computers and...
The multicore era has increased the need for highly parallel software. Since automatic parallelizati...
It is today's general wisdom that the productive use of parallel architectures depends cruciall...
Performance analysis of parallel programs continues to be challenging for programmers. Programmers h...
Abstract — A well organized parallel application can accomplish better performance over sequential e...
Recent desktop and mobile processors often integrate CPU and GPU onto the same die. The limited memo...
Many problems currently require more processor throughput than can be achieved with current single-p...
peer-reviewedIn recent years there has been a shift in microprocessor manufacture from building sing...
There is an increasing need for a framework that supports research on portable high-performance para...
We have been pursuing a research program aimed at enhancing productivity and performance in parallel...
Parallel software development must face the fact that different architectures require different impl...
[[abstract]]Performing run-time parallelization on general networks of workstations (NOWs) without s...
1. Introduction The SCOOPP (SCalable Object Oriented Parallel Programming) system described in [Sob...
Advances in hardware technologies combined with decreased costs have started a trend towards massive...
Coarse-grained task parallelism exists in sequential code and can be leveraged to boost the use of ...
Today multicore technology is the standard processor technology for desktop and laptop computers and...
The multicore era has increased the need for highly parallel software. Since automatic parallelizati...
It is today's general wisdom that the productive use of parallel architectures depends cruciall...
Performance analysis of parallel programs continues to be challenging for programmers. Programmers h...
Abstract — A well organized parallel application can accomplish better performance over sequential e...
Recent desktop and mobile processors often integrate CPU and GPU onto the same die. The limited memo...
Many problems currently require more processor throughput than can be achieved with current single-p...
peer-reviewedIn recent years there has been a shift in microprocessor manufacture from building sing...
There is an increasing need for a framework that supports research on portable high-performance para...
We have been pursuing a research program aimed at enhancing productivity and performance in parallel...
Parallel software development must face the fact that different architectures require different impl...
[[abstract]]Performing run-time parallelization on general networks of workstations (NOWs) without s...
1. Introduction The SCOOPP (SCalable Object Oriented Parallel Programming) system described in [Sob...
Advances in hardware technologies combined with decreased costs have started a trend towards massive...
Coarse-grained task parallelism exists in sequential code and can be leveraged to boost the use of ...
Today multicore technology is the standard processor technology for desktop and laptop computers and...
The multicore era has increased the need for highly parallel software. Since automatic parallelizati...
It is today's general wisdom that the productive use of parallel architectures depends cruciall...