There are two competing models for the on-chip memory in Chip Multiprocessor (CMP) systems: hardware-managed coherent caches and software-managed streaming memory. This paper performs a direct comparison of the two models under the same set of assumptions about technology, area, and computational capabilities. The goal is to quantify how and when they differ in terms of perfor-mance, energy consumption, bandwidth requirements, and latency tolerance for general-purpose CMPs. We demonstrate that for data-parallel applications on systems with up to 16 cores, the cache-based and streaming models perform and scale equally well. For certain applications with little data reuse, streaming scales better due to better bandwidth use and macroscopic so...
This dissertation presents several models for performance, power, and thermal estimations in high-pe...
?Signatures are on le in the Graduate School. iii Chip multiprocessors (CMPs) are becoming a popular...
The integration of an increasing amount of on-chip hardware in Chip-Multiprocessors (CMPs) poses a c...
In the world of complex SoCs for consumer applica-tions, multiprocessor architectures usually deploy...
Abstract—Efficient utilizing on-chip storage space on Chip-Multiprocessors (CMPs) has become an impo...
Continued advances in circuit integration technology has ushered in the era of chip multiprocessor (...
Abstract — Performance tradeoffs between fast data access by local data replication and cache capaci...
We compare the performance of three major programming models— a load-store cache-coherent shared add...
Shared memory is a common interprocessor communi-cation paradigm for single-chip multi-processor pla...
This dissertation aims at improving the off-chip bandwidth utilization and energy efficiency in chip...
Abstract—By integrating multiple cores in a single chip, Chip Multiprocessors (CMP) provide an attra...
Efficient use of the memory hierarchy is critical for achieving high performance in a multiprocessor...
New generation System-on-Chips will be extremely complex devices, composed from complex subsystems, ...
The continued decrease in transistor size and the increasing delay of wires relative to transistor s...
The memory consistency model of a shared-memory multiprocessor determines the extent to which memory...
This dissertation presents several models for performance, power, and thermal estimations in high-pe...
?Signatures are on le in the Graduate School. iii Chip multiprocessors (CMPs) are becoming a popular...
The integration of an increasing amount of on-chip hardware in Chip-Multiprocessors (CMPs) poses a c...
In the world of complex SoCs for consumer applica-tions, multiprocessor architectures usually deploy...
Abstract—Efficient utilizing on-chip storage space on Chip-Multiprocessors (CMPs) has become an impo...
Continued advances in circuit integration technology has ushered in the era of chip multiprocessor (...
Abstract — Performance tradeoffs between fast data access by local data replication and cache capaci...
We compare the performance of three major programming models— a load-store cache-coherent shared add...
Shared memory is a common interprocessor communi-cation paradigm for single-chip multi-processor pla...
This dissertation aims at improving the off-chip bandwidth utilization and energy efficiency in chip...
Abstract—By integrating multiple cores in a single chip, Chip Multiprocessors (CMP) provide an attra...
Efficient use of the memory hierarchy is critical for achieving high performance in a multiprocessor...
New generation System-on-Chips will be extremely complex devices, composed from complex subsystems, ...
The continued decrease in transistor size and the increasing delay of wires relative to transistor s...
The memory consistency model of a shared-memory multiprocessor determines the extent to which memory...
This dissertation presents several models for performance, power, and thermal estimations in high-pe...
?Signatures are on le in the Graduate School. iii Chip multiprocessors (CMPs) are becoming a popular...
The integration of an increasing amount of on-chip hardware in Chip-Multiprocessors (CMPs) poses a c...