This paper explores statistical simulation as a fast simulation technique for driving chip multiprocessor (CMP) design space exploration. The idea of statistical simulation is to measure a number of important program execution characteristics, generate a synthetic trace, and simulate that synthetic trace. The important benefit is that a synthetic trace is very small compared to real program traces. This paper advances statistical simulation by modeling shared resources, such as shared caches and off-chip bandwidth. This is done (i) by collecting cache set access probabilities and per-set LRU stack depth profiles, and (ii) by modeling a program's time-varying execution behavior in the synthetic trace. The key benefit is that the statistical...
We present a new technique for the parallel simulation of cache coherent shared memory multiprocess...
Architecture simulation tools are extremely useful not only to predict the performance of future sys...
Application performance on computer processors depends on a number of complex architectural and micr...
This paper explores statistical simulation as a fast simulation technique for driving chip multiproc...
Developing fast chip multiprocessor simulation techniques is a challenging problem. Solving this pro...
According to Moore’s law the number of transistors on a single chip doubles every 18 months. To resp...
Detailed, cycle-accurate processor simulation is an inte-gral component of the design and study of c...
Composing a representative multi-program multi-core workload is non-trivial. A multi-core processor ...
To increase performance, modern processors employ complex techniques such as out-of-order pipelines ...
Statistical simulation is known to be a useful technique to efficiently cull huge design spaces in l...
Abstract — Performance tradeoffs between fast data access by local data replication and cache capaci...
Designing a new microprocessor is extremely time-consuming. One of the contributing reasons is that ...
Writing well-performing parallel programs is challenging in the multi-core processor era. In additio...
As multiprocessor systems-on-chip become a reality, perfor-mance modeling becomes a challenge. To qu...
Microarchitectural simulation of multithreaded architectures with shared resources, such as simultan...
We present a new technique for the parallel simulation of cache coherent shared memory multiprocess...
Architecture simulation tools are extremely useful not only to predict the performance of future sys...
Application performance on computer processors depends on a number of complex architectural and micr...
This paper explores statistical simulation as a fast simulation technique for driving chip multiproc...
Developing fast chip multiprocessor simulation techniques is a challenging problem. Solving this pro...
According to Moore’s law the number of transistors on a single chip doubles every 18 months. To resp...
Detailed, cycle-accurate processor simulation is an inte-gral component of the design and study of c...
Composing a representative multi-program multi-core workload is non-trivial. A multi-core processor ...
To increase performance, modern processors employ complex techniques such as out-of-order pipelines ...
Statistical simulation is known to be a useful technique to efficiently cull huge design spaces in l...
Abstract — Performance tradeoffs between fast data access by local data replication and cache capaci...
Designing a new microprocessor is extremely time-consuming. One of the contributing reasons is that ...
Writing well-performing parallel programs is challenging in the multi-core processor era. In additio...
As multiprocessor systems-on-chip become a reality, perfor-mance modeling becomes a challenge. To qu...
Microarchitectural simulation of multithreaded architectures with shared resources, such as simultan...
We present a new technique for the parallel simulation of cache coherent shared memory multiprocess...
Architecture simulation tools are extremely useful not only to predict the performance of future sys...
Application performance on computer processors depends on a number of complex architectural and micr...