Multicore processors can improve performance by decreasing the execution latency of parallel programs, or by increasing throughput, i.e., the quantity of work done per unit of time when executing independent tasks. Throughput is not necessarily proportional to the number of cores and can be impacted significantly by resource sharing in several parts of the microarchitecture. Quantifying the impact of resource sharing on throughput requires a throughput metric. A majority of microarchitecture studies use equal-time throughput metrics, such as IPC throughput or weighted speedup, that are based on the implicit assumption that all the jobs execute for a fixed and equal time. We argue that this assumption is not realistic. We propose and charact...
For computer designers, understanding the characteristics of workloads running on current and future...
Assessing the performance of multiprogram workloads running on multithreaded hardware is difficult b...
International audience—Estimating the potential performance of parallel applications on the yet-to-b...
Multicore processors can improve performance by decreasing the execution latency of parallel program...
Measuring throughput is not as straightforward as measuring execution time. This has led to an ongoi...
Running multiple programs on a processor aims at increasing the throughput of that processor. Howeve...
International audienceRunning multiple programs on a processor aims at increasing the throughput of ...
International audienceSeveral different metrics have been proposed for quantifying the throughput of...
Weighted speedup is nowadays the most commonly used multiprogram workload performance metric. Weight...
Abstract—Weighted speedup is nowadays the most commonly used multiprogram workload performance metri...
Advanced many-core CPU chips already have few hundreds of processing cores (e.g. 160 cores in an IBM...
Understanding the behavior of emerging workloads is important for designing next generation micropro...
International audienceMost high-performance processors today are able to execute multiple threads of...
Designers of chip multiprocessors will increasingly be called upon to optimize for a combination of ...
For computer designers, understanding the characteristics of workloads running on current and future...
Assessing the performance of multiprogram workloads running on multithreaded hardware is difficult b...
International audience—Estimating the potential performance of parallel applications on the yet-to-b...
Multicore processors can improve performance by decreasing the execution latency of parallel program...
Measuring throughput is not as straightforward as measuring execution time. This has led to an ongoi...
Running multiple programs on a processor aims at increasing the throughput of that processor. Howeve...
International audienceRunning multiple programs on a processor aims at increasing the throughput of ...
International audienceSeveral different metrics have been proposed for quantifying the throughput of...
Weighted speedup is nowadays the most commonly used multiprogram workload performance metric. Weight...
Abstract—Weighted speedup is nowadays the most commonly used multiprogram workload performance metri...
Advanced many-core CPU chips already have few hundreds of processing cores (e.g. 160 cores in an IBM...
Understanding the behavior of emerging workloads is important for designing next generation micropro...
International audienceMost high-performance processors today are able to execute multiple threads of...
Designers of chip multiprocessors will increasingly be called upon to optimize for a combination of ...
For computer designers, understanding the characteristics of workloads running on current and future...
Assessing the performance of multiprogram workloads running on multithreaded hardware is difficult b...
International audience—Estimating the potential performance of parallel applications on the yet-to-b...