. In this paper we present a new method for achieving a higher cost--efficiency on parallel computers. We insert routines into a program which detect the amount of computational work without using problem-- specific parameters and adapt the number of used CPUs at runtime under given speedup/efficiency constraints. Several user--tunable strategies for selecting the number of processors are presented and compared. The modularity of this approach and its application--independence permit a general use on parallel computers with a scalable degree of parallelism. 1 Introduction Programs on parallel computers usually use all available processors. This is a waste of resources if the load is not evenly distributed or the amount of work is too small...
The goal of parallelizing, or restructuring, compilers is to detect and exploit parallelism in seque...
Advanced many-core CPU chips already have few hundreds of processing cores (e.g. 160 cores in an IBM...
Parallel computers can provide impressive speedups, but unfortunately such speedups are difficult to...
We address the problem of maximizing application speedup through runtime, self-selection of an appro...
Parallel programming involves finding the potential parallelism in an application, choosing an algor...
The multicore era has increased the need for highly parallel software. Since automatic parallelizati...
Load balancing increases the efficient usage of existing resources for parallel and distributed appl...
Given a multiprocessor system and a workload composed of several parallel programs, the processor al...
It is possible to reduce the computation time of data parallel programs by dividing the computation ...
Given the wide scale adoption of multi-cores in main stream computing, parallel programs rarely exec...
Given the wide scale adoption of multi-cores in main stream computing, parallel programs rarely exec...
As moderate-scale multiprocessors become widely used, we foresee an increased demand for effective c...
Today multicore technology is the standard processor technology for desktop and laptop computers and...
In recent years parallel computing has become ubiquitous. Lead by the spread of commodity multicore ...
While parallel programming is needed to solve large-scale scientific applications, it is more diffic...
The goal of parallelizing, or restructuring, compilers is to detect and exploit parallelism in seque...
Advanced many-core CPU chips already have few hundreds of processing cores (e.g. 160 cores in an IBM...
Parallel computers can provide impressive speedups, but unfortunately such speedups are difficult to...
We address the problem of maximizing application speedup through runtime, self-selection of an appro...
Parallel programming involves finding the potential parallelism in an application, choosing an algor...
The multicore era has increased the need for highly parallel software. Since automatic parallelizati...
Load balancing increases the efficient usage of existing resources for parallel and distributed appl...
Given a multiprocessor system and a workload composed of several parallel programs, the processor al...
It is possible to reduce the computation time of data parallel programs by dividing the computation ...
Given the wide scale adoption of multi-cores in main stream computing, parallel programs rarely exec...
Given the wide scale adoption of multi-cores in main stream computing, parallel programs rarely exec...
As moderate-scale multiprocessors become widely used, we foresee an increased demand for effective c...
Today multicore technology is the standard processor technology for desktop and laptop computers and...
In recent years parallel computing has become ubiquitous. Lead by the spread of commodity multicore ...
While parallel programming is needed to solve large-scale scientific applications, it is more diffic...
The goal of parallelizing, or restructuring, compilers is to detect and exploit parallelism in seque...
Advanced many-core CPU chips already have few hundreds of processing cores (e.g. 160 cores in an IBM...
Parallel computers can provide impressive speedups, but unfortunately such speedups are difficult to...