Following an initial mapping of a problem onto a multiprocessor machine or computer network, system performance often deteriorates with time. In order to maintain high performance, it may be necessary to remap the problem. The decision to remap must take into account measurements of performance deterioration, the cost of remapping, and the estimated benefits achieved by remapping. We examine the tradeoff between the costs and the benefits of remapping two qualitatively different kinds of problems. One problem assumes that performance deteriorates gradually, the other assumes that performance deteriorates suddenly. We consider a variety of policies for governing when to remap. In order to evaluate these policies, statistical models of proble...
This paper describes computer-based heuristic models for Reliability and maintainability (R&M) alloc...
Testing large-scale systems is expensive in terms of both time and money. Running simulations early ...
AbstractImperfect coverage and nonnegligible reconfiguration delay are known to have a deleterious e...
A large class of computational problems is characterized by frequent synchronization, and computatio...
The effectiveness of any given mapping of workload to processors in a parallel system is dependent o...
A large class of computations are characterized by a sequence of phases, with phase changes occurrin...
For bulk synchronous computations that have non-deterministic behaviors, dynamic remapping is an eff...
For bulk synchronous computations that have nondeterministic behaviors, dynamic remapping is an effe...
The use of dynamic reconfiguration has been proposed to tolerate faults in large-scale partitionable...
A new method to determin an optimal retry policy and for use in retry of fault characterization is p...
A central problem to parallel processing is the determination of an effective partitioning of worklo...
Modern day hardware platforms are parallel and diverse, ranging from mobiles to data centers. Mains...
A phase-by-phase modelling technique is introduced to evaluate a fault tolerant system's ability to ...
dynamic programming using function approximators Preface Control systems are making a tremendous imp...
A dynamic programming model with time-domain simulations of contingencies is created to find the lea...
This paper describes computer-based heuristic models for Reliability and maintainability (R&M) alloc...
Testing large-scale systems is expensive in terms of both time and money. Running simulations early ...
AbstractImperfect coverage and nonnegligible reconfiguration delay are known to have a deleterious e...
A large class of computational problems is characterized by frequent synchronization, and computatio...
The effectiveness of any given mapping of workload to processors in a parallel system is dependent o...
A large class of computations are characterized by a sequence of phases, with phase changes occurrin...
For bulk synchronous computations that have non-deterministic behaviors, dynamic remapping is an eff...
For bulk synchronous computations that have nondeterministic behaviors, dynamic remapping is an effe...
The use of dynamic reconfiguration has been proposed to tolerate faults in large-scale partitionable...
A new method to determin an optimal retry policy and for use in retry of fault characterization is p...
A central problem to parallel processing is the determination of an effective partitioning of worklo...
Modern day hardware platforms are parallel and diverse, ranging from mobiles to data centers. Mains...
A phase-by-phase modelling technique is introduced to evaluate a fault tolerant system's ability to ...
dynamic programming using function approximators Preface Control systems are making a tremendous imp...
A dynamic programming model with time-domain simulations of contingencies is created to find the lea...
This paper describes computer-based heuristic models for Reliability and maintainability (R&M) alloc...
Testing large-scale systems is expensive in terms of both time and money. Running simulations early ...
AbstractImperfect coverage and nonnegligible reconfiguration delay are known to have a deleterious e...