For bulk synchronous computations that have nondeterministic behaviors, dynamic remapping is an effective approach to ensure parallel efficiency. There are two basic issues in remapping: when and how to remap. This paper presents a formal treatment of the first issue for dynamic computations with a priori known statistical behaviors. We have formulated the problem as two complement sequential stochastic optimization, with an objective of finding optimal remapping frequencies for a given tolerance of load imbalance on multiprogrammed distributed systems. We have developed analytical approaches to precisely characterize the transient statistical behaviors of the workload process and derived optimal remapping frequencies for various random wor...
Traditional network models have played an important role over the last four decades in providing ins...
In a distributed heterogeneous computing system, the resources have different capabilities and tasks...
This paper examines the performance of synchronous checkpointing in a distributed computing environm...
For bulk synchronous computations that have non-deterministic behaviors, dynamic remapping is an eff...
A large class of computational problems is characterized by frequent synchronization, and computatio...
The effectiveness of any given mapping of workload to processors in a parallel system is dependent o...
A large class of computations are characterized by a sequence of phases, with phase changes occurrin...
Following an initial mapping of a problem onto a multiprocessor machine or computer network, system ...
Abstract—Many parallel applications require periodic redistribution of workloads and associated data...
A fundamental issue affecting the performance of a parallel application running on message-passing p...
Abstract — In this paper we give a theoretical model for determining the synchronization frequency t...
This thesis presents and analyzes scalable algorithms for dynamic load balancing and mapping in dist...
This paper describes the performance of localitybased mapping and remapping partitioners for unstruc...
The performance of anytime algorithms can be improved by simultaneously solving several instances of...
The overall efficiency of parallel algorithms is most decisively effected by the strategy applied fo...
Traditional network models have played an important role over the last four decades in providing ins...
In a distributed heterogeneous computing system, the resources have different capabilities and tasks...
This paper examines the performance of synchronous checkpointing in a distributed computing environm...
For bulk synchronous computations that have non-deterministic behaviors, dynamic remapping is an eff...
A large class of computational problems is characterized by frequent synchronization, and computatio...
The effectiveness of any given mapping of workload to processors in a parallel system is dependent o...
A large class of computations are characterized by a sequence of phases, with phase changes occurrin...
Following an initial mapping of a problem onto a multiprocessor machine or computer network, system ...
Abstract—Many parallel applications require periodic redistribution of workloads and associated data...
A fundamental issue affecting the performance of a parallel application running on message-passing p...
Abstract — In this paper we give a theoretical model for determining the synchronization frequency t...
This thesis presents and analyzes scalable algorithms for dynamic load balancing and mapping in dist...
This paper describes the performance of localitybased mapping and remapping partitioners for unstruc...
The performance of anytime algorithms can be improved by simultaneously solving several instances of...
The overall efficiency of parallel algorithms is most decisively effected by the strategy applied fo...
Traditional network models have played an important role over the last four decades in providing ins...
In a distributed heterogeneous computing system, the resources have different capabilities and tasks...
This paper examines the performance of synchronous checkpointing in a distributed computing environm...