This paper proposes an analytical model to predict Memory-Level Parallelism (MLP) in a superscalar processor. We profile the workload once and measure a set of distributions to characterize the workload's inherent memory behavior. We subsequently generate a virtual instruction stream, over which we then process an abstract MLP model to predict MLP for a particular micro-architecture with a given ROB size, LLC size, MSHR size and stride-based prefetcher. Experimental evaluation reports an improvement in modeling error from 16.9 percent for previous work to 3.6 percent on average for the proposed model
Analytical performance models yield valuable architectural insight without incurring the excessive r...
To continuously comply with Moore's Law, modern parallel machines become increasingly complex. Effec...
The ever-increasing computational power of contemporary microprocessors reduces the execution time s...
A mechanistic model for out-of-order superscalar processors is developed and then applied to the stu...
A proposed performance model for superscalar processors consists of 1) a component that models the r...
Mechanistic processor performance modeling builds an analytical model from understanding the underly...
As the number of transistors integrated on a chip continues to increase, a growing challenge is accu...
Superscalar in-order processors form an interesting alternative to out-of-order processors because o...
Optimizing processors for (a) specific application(s) can substantially improve energy-efficiency. W...
The performance of memory-bound commercial applications such as databases is limited by increasing m...
Superscalar in-order processors form an interesting alternative to out-of-order processors because o...
The ever-increasing computational power of contemporary microprocessors reduces the execution time s...
Parametric micro-level (PM) performance models are introduced to address the important issue of how ...
To increase performance, modern processors employ complex techniques such as out-of-order pipelines ...
Application performance on computer processors depends on a number of complex architectural and micr...
Analytical performance models yield valuable architectural insight without incurring the excessive r...
To continuously comply with Moore's Law, modern parallel machines become increasingly complex. Effec...
The ever-increasing computational power of contemporary microprocessors reduces the execution time s...
A mechanistic model for out-of-order superscalar processors is developed and then applied to the stu...
A proposed performance model for superscalar processors consists of 1) a component that models the r...
Mechanistic processor performance modeling builds an analytical model from understanding the underly...
As the number of transistors integrated on a chip continues to increase, a growing challenge is accu...
Superscalar in-order processors form an interesting alternative to out-of-order processors because o...
Optimizing processors for (a) specific application(s) can substantially improve energy-efficiency. W...
The performance of memory-bound commercial applications such as databases is limited by increasing m...
Superscalar in-order processors form an interesting alternative to out-of-order processors because o...
The ever-increasing computational power of contemporary microprocessors reduces the execution time s...
Parametric micro-level (PM) performance models are introduced to address the important issue of how ...
To increase performance, modern processors employ complex techniques such as out-of-order pipelines ...
Application performance on computer processors depends on a number of complex architectural and micr...
Analytical performance models yield valuable architectural insight without incurring the excessive r...
To continuously comply with Moore's Law, modern parallel machines become increasingly complex. Effec...
The ever-increasing computational power of contemporary microprocessors reduces the execution time s...