Chip-multiprocessor (CMP) architectures are a promising design alternative to exploit the everincreasing number of transistors that can be put on a die. To deliver high performance on applications that cannot be easily parallelized, CMPs can use additional support for speculatively executing the possibly data-dependent threads of an application. For cross-thread dependences that must be handled dynamically, the threads can be made to synchronize and communicate either at the register level or at the memory level. In the past, it has been unclear whether the higher hardware cost of register-level communication is cost-eective. In this paper, we show that the wide-issue dynamic processors that will soon populate CMPs, make fast commun...
Chip multiprocessors (CMPs) substantially increase capacity pressure on the on-chip memory hierarchy...
Microprocessor industry has converged on chip multiprocessor (CMP) as the architecture of choice to ...
The future of performance scaling lies in massively parallel workloads, but less-parallel applicati...
Abstract Chip-multiprocessor (CMP) architectures are a promising design alternative to exploit the e...
While Chip Multiprocessors (CMP) with Speculative Multithreading (SM) support have been gaining mome...
Exploitation of parallelism has for decades been central to the pursuit of computing performance. Th...
Chip Multiprocessors (CMPs) or multi-core architectures are a new class of processor architectures. ...
With the proliferation of Chip Multiprocessors (CMPs), shared memory multi-threaded programs are exp...
As the increasing of issue width has diminishing returns with superscalar processor, thread parallel...
As technology advances, microprocessors that support multiple threads of execution on a single chip ...
Abstract Chip multi-processors (CMPs) already have widespread com-mercial availability, and technolo...
Due to memory bandwidth limitations, chip multiprocessors (CMP) adopting the convenient shared memor...
This paper presents a novel mechanism for barrier synchronization on chip multi-processors (CMPs). B...
On the road to computer systems able to support the requirements of exascale applications, Chip Mult...
Moving threads is a theoretically interesting approach for mapping the computation of an application...
Chip multiprocessors (CMPs) substantially increase capacity pressure on the on-chip memory hierarchy...
Microprocessor industry has converged on chip multiprocessor (CMP) as the architecture of choice to ...
The future of performance scaling lies in massively parallel workloads, but less-parallel applicati...
Abstract Chip-multiprocessor (CMP) architectures are a promising design alternative to exploit the e...
While Chip Multiprocessors (CMP) with Speculative Multithreading (SM) support have been gaining mome...
Exploitation of parallelism has for decades been central to the pursuit of computing performance. Th...
Chip Multiprocessors (CMPs) or multi-core architectures are a new class of processor architectures. ...
With the proliferation of Chip Multiprocessors (CMPs), shared memory multi-threaded programs are exp...
As the increasing of issue width has diminishing returns with superscalar processor, thread parallel...
As technology advances, microprocessors that support multiple threads of execution on a single chip ...
Abstract Chip multi-processors (CMPs) already have widespread com-mercial availability, and technolo...
Due to memory bandwidth limitations, chip multiprocessors (CMP) adopting the convenient shared memor...
This paper presents a novel mechanism for barrier synchronization on chip multi-processors (CMPs). B...
On the road to computer systems able to support the requirements of exascale applications, Chip Mult...
Moving threads is a theoretically interesting approach for mapping the computation of an application...
Chip multiprocessors (CMPs) substantially increase capacity pressure on the on-chip memory hierarchy...
Microprocessor industry has converged on chip multiprocessor (CMP) as the architecture of choice to ...
The future of performance scaling lies in massively parallel workloads, but less-parallel applicati...