We present a new technique for the parallel simulation of cache coherent shared memory multiprocessors. Our technique is based on the fact that the functional correctness of the simulation can be decoupled from its timing correctness. Thus in our simulations we can exploit as much parallelism as is available in the application without being constrained by conservative scheduling mechanisms that might limit the available parallelism in order to guarantee the timing correctness of the simulation. Further, application specific details (which can be gleaned from the compiler) such as data layout in the caches of the target architecture can be exploited to reduce the overhead of the simulation. The simulation correctness is guaranteed...
Abstract As the gap between processor and memory speeds increases, cache performance becomes more cr...
In this research we built a SystemC Level-1 data cache system in a distributed shared memory archite...
This paper examines the cost/performance of simulating a hypothetical target parallel computer using...
APPROXIMATE TIME-PARALLEL CACHE SIMULATION In time-parallel simulation, the simulation time axis is ...
[[abstract]]We propose efficient stack simulation algorithms for shared memory multiprocessor (MP) c...
Fast computer simulation is an essential tool in the design of large parallel computers. Our Fast Ac...
Simulation is an important means of evaluating new microarchitectures. With the invention of multi-c...
To increase performance, modern processors employ complex techniques such as out-of-order pipelines ...
An application’s cache miss rate is used in timing analysis, system performance prediction and ...
As multiprocessor systems-on-chip become a reality, perfor-mance modeling becomes a challenge. To qu...
This paper explores statistical simulation as a fast simulation technique for driving chip multiproc...
Journal ArticleA variation of the Time Warp parallel discrete event simulation mechanism is presente...
Simulation has emerged as the primary means for evaluating the design of multiprocessor systems. Sim...
In optimistically synchronized parallel simulators logical processes execute events greedily and rec...
This paper describes an object oriented simulator model for parallel computer systems that is design...
Abstract As the gap between processor and memory speeds increases, cache performance becomes more cr...
In this research we built a SystemC Level-1 data cache system in a distributed shared memory archite...
This paper examines the cost/performance of simulating a hypothetical target parallel computer using...
APPROXIMATE TIME-PARALLEL CACHE SIMULATION In time-parallel simulation, the simulation time axis is ...
[[abstract]]We propose efficient stack simulation algorithms for shared memory multiprocessor (MP) c...
Fast computer simulation is an essential tool in the design of large parallel computers. Our Fast Ac...
Simulation is an important means of evaluating new microarchitectures. With the invention of multi-c...
To increase performance, modern processors employ complex techniques such as out-of-order pipelines ...
An application’s cache miss rate is used in timing analysis, system performance prediction and ...
As multiprocessor systems-on-chip become a reality, perfor-mance modeling becomes a challenge. To qu...
This paper explores statistical simulation as a fast simulation technique for driving chip multiproc...
Journal ArticleA variation of the Time Warp parallel discrete event simulation mechanism is presente...
Simulation has emerged as the primary means for evaluating the design of multiprocessor systems. Sim...
In optimistically synchronized parallel simulators logical processes execute events greedily and rec...
This paper describes an object oriented simulator model for parallel computer systems that is design...
Abstract As the gap between processor and memory speeds increases, cache performance becomes more cr...
In this research we built a SystemC Level-1 data cache system in a distributed shared memory archite...
This paper examines the cost/performance of simulating a hypothetical target parallel computer using...