It has been already verified that hardware-supported fine-grain synchronization provides a significant performance improvement over coarse-grained synchronization mechanisms, such as barriers. Support for fine-grain synchronization on individual data items becomes notably important in order to implement thread-level parallelism more efficiently. One of the major goals of this project is to propose a new efficient way to support fine-grain synchronization mechanisms in multiprocessors. This novel idea is based on the efficient combination of fine-grain synchronization with cache coherence and instruction level parallelism. Both snoopy and directory-based cache coherence protocols have been studied. The work includes the definition of the com...
Emerging multiprocessor architectures such as chip multiprocessors, embedded architectures, and mas...
Multi-core chip architectures are becoming mainstream, permitting increasing on-chip paral-lelism th...
Due to VLSI lithography problems and the limitation of additional architectural enhancements uniproc...
The quest to improve performance forces designers to explore finer-grained multiprocessor machines. ...
Although improved device technology has increased the performance of computer systems, fundamental h...
As the multiprocessors scale beyond the limits of a few tens of processors, we must look beyond the ...
Cache coherence is one of the main challenges to tackle when designing a shared-memory multiprocesso...
In large scale machines, thousands of processor cycles, in other words, missed opportunities to issu...
A wide variety of computer architectures have been proposed to exploit parallelism at different gran...
This paper investigates the performance of synchronization algorithms on ccNUMA multiprocessors, fro...
. Data used by parallel programs can be divided into classes, based on how threads access it. For di...
This paper proposes a set of efficient primitives for process synchronization in multiprocessors. T...
Thesis: S.M., Massachusetts Institute of Technology, Department of Electrical Engineering and Comput...
AbstreetThis paper proposes a set of efficient primitives for process synchronization in muitiproces...
In a shared-memory multiprocessor with private caches, cached copies of a data item must be kept con...
Emerging multiprocessor architectures such as chip multiprocessors, embedded architectures, and mas...
Multi-core chip architectures are becoming mainstream, permitting increasing on-chip paral-lelism th...
Due to VLSI lithography problems and the limitation of additional architectural enhancements uniproc...
The quest to improve performance forces designers to explore finer-grained multiprocessor machines. ...
Although improved device technology has increased the performance of computer systems, fundamental h...
As the multiprocessors scale beyond the limits of a few tens of processors, we must look beyond the ...
Cache coherence is one of the main challenges to tackle when designing a shared-memory multiprocesso...
In large scale machines, thousands of processor cycles, in other words, missed opportunities to issu...
A wide variety of computer architectures have been proposed to exploit parallelism at different gran...
This paper investigates the performance of synchronization algorithms on ccNUMA multiprocessors, fro...
. Data used by parallel programs can be divided into classes, based on how threads access it. For di...
This paper proposes a set of efficient primitives for process synchronization in multiprocessors. T...
Thesis: S.M., Massachusetts Institute of Technology, Department of Electrical Engineering and Comput...
AbstreetThis paper proposes a set of efficient primitives for process synchronization in muitiproces...
In a shared-memory multiprocessor with private caches, cached copies of a data item must be kept con...
Emerging multiprocessor architectures such as chip multiprocessors, embedded architectures, and mas...
Multi-core chip architectures are becoming mainstream, permitting increasing on-chip paral-lelism th...
Due to VLSI lithography problems and the limitation of additional architectural enhancements uniproc...