This paper introduces the EC frontend and DSIM simulator. Given a parallel program, they determine its execution time on huge networks of computers. EC extracts task step needs. DSIM predicts completion times rather than simulating each program step. This paper contains analyses of the memory savings and the execution time savings for simulations of one to 2,800 computers running parallel Gaussian elimination and fast Fourier transform. The time savings are 20%(two days) for fifty runs of Gaussian reduction of a 400 \Theta 401 matrix to solve 400 linear equations. Memory needs are reduced 99% (637 MBytes) per simulation run. The memory savings allow simulation of parallel programs running on thousands of processors. These huge network sizes...
We assess gains from parallel computation on Backlight supercomputer. The information transfers are ...
The associative memory (AM) system is a computing device made of hundreds of AM ASICs chips designed...
The limits of sequential processing continue to be overcome with parallel and distributed architectu...
One method to evaluate a distributed shared memory(DSM) system is to analyze its performance for a v...
The limits of sequential processing continue to be overcome with parallel and distributed architectu...
Parallel computer programs are used to speed up the calculation of computationally-demanding scienti...
This paper examines the cost/performance of simulating a hypothetical target parallel computer using...
The simulation of parallel systems is an alternative approach to classical parallel system programmi...
Detailed simulations of large scale message-passing interface parallel applications are extremely ti...
Distributed shared memory systems have become popular as a means of utilizing clusters of com-puters...
PhD ThesisThis thesis develops and evaluates a number of efficient algorithms for performing paralle...
[[abstract]]In recent years, it has gradually become popular to use discrete-event simulation as a t...
This paper presents a technique which attempts to aid the simulationist in the decision as to whethe...
Thesis (M. Eng.)--Massachusetts Institute of Technology, Dept. of Electrical Engineering and Compute...
AbstractIn this paper, the execution time cost of a parallel computation in a shared memory environm...
We assess gains from parallel computation on Backlight supercomputer. The information transfers are ...
The associative memory (AM) system is a computing device made of hundreds of AM ASICs chips designed...
The limits of sequential processing continue to be overcome with parallel and distributed architectu...
One method to evaluate a distributed shared memory(DSM) system is to analyze its performance for a v...
The limits of sequential processing continue to be overcome with parallel and distributed architectu...
Parallel computer programs are used to speed up the calculation of computationally-demanding scienti...
This paper examines the cost/performance of simulating a hypothetical target parallel computer using...
The simulation of parallel systems is an alternative approach to classical parallel system programmi...
Detailed simulations of large scale message-passing interface parallel applications are extremely ti...
Distributed shared memory systems have become popular as a means of utilizing clusters of com-puters...
PhD ThesisThis thesis develops and evaluates a number of efficient algorithms for performing paralle...
[[abstract]]In recent years, it has gradually become popular to use discrete-event simulation as a t...
This paper presents a technique which attempts to aid the simulationist in the decision as to whethe...
Thesis (M. Eng.)--Massachusetts Institute of Technology, Dept. of Electrical Engineering and Compute...
AbstractIn this paper, the execution time cost of a parallel computation in a shared memory environm...
We assess gains from parallel computation on Backlight supercomputer. The information transfers are ...
The associative memory (AM) system is a computing device made of hundreds of AM ASICs chips designed...
The limits of sequential processing continue to be overcome with parallel and distributed architectu...