Plentiful research has addressed low-complexity software-based shared-memory systems since the idea was first introduced more than two decades ago. However, software-coherent systems have not been very successful in the commercial marketplace. We believe there are two main reasons for this: lack of performance and/or lack of binary compatibility. This thesis studies multiple aspects of how to design future binary-compatible high-performance scalable shared-memory servers while keeping the hardware complexity at a minimum. It starts with a software-based distributed shared-memory system relying on no specific hardware support and gradually moves towards architectures with simple hardware support. The evaluation is made in a modern chip-multi...
The last decade has produced enormous improvements in processor speeds without a corresponding impro...
Shared memory provides an attractive and intuitive programming model that makes good use of programm...
Although improved device technology has increased the performance of computer systems, fundamental h...
Plentiful research has addressed low-complexity software-based shared-memory systems since the idea ...
Shared-memory architectures represent a class of parallel computer systems commonly used in the comm...
We argue that OS-provided data coherence on non-cache-coherent NUMA multiprocessors (machines with a...
Shared memory is widely regarded as a more intuitive model than message passing for the development ...
The advances in semiconductor technology have set the shared memory server trend towards processors ...
This work describes a cache architecture and memory model for 1000+ core microprocessors. Our appro...
Symmetric multiprocessors (SMPs) connected with low-latency networks provide attractive building blo...
Cache coherence is one of the main challenges to tackle when designing a shared-memory multiprocesso...
In programming high performance applications, shared address-space platforms are preferable for fine...
In a shared-memory multiprocessor with private caches, cached copies of a data item must be kept con...
We present design details and some initial performance results of a novel scalable shared memory mul...
Scalable shared-memory multiprocessors provide a flexible programming model with good performance sc...
The last decade has produced enormous improvements in processor speeds without a corresponding impro...
Shared memory provides an attractive and intuitive programming model that makes good use of programm...
Although improved device technology has increased the performance of computer systems, fundamental h...
Plentiful research has addressed low-complexity software-based shared-memory systems since the idea ...
Shared-memory architectures represent a class of parallel computer systems commonly used in the comm...
We argue that OS-provided data coherence on non-cache-coherent NUMA multiprocessors (machines with a...
Shared memory is widely regarded as a more intuitive model than message passing for the development ...
The advances in semiconductor technology have set the shared memory server trend towards processors ...
This work describes a cache architecture and memory model for 1000+ core microprocessors. Our appro...
Symmetric multiprocessors (SMPs) connected with low-latency networks provide attractive building blo...
Cache coherence is one of the main challenges to tackle when designing a shared-memory multiprocesso...
In programming high performance applications, shared address-space platforms are preferable for fine...
In a shared-memory multiprocessor with private caches, cached copies of a data item must be kept con...
We present design details and some initial performance results of a novel scalable shared memory mul...
Scalable shared-memory multiprocessors provide a flexible programming model with good performance sc...
The last decade has produced enormous improvements in processor speeds without a corresponding impro...
Shared memory provides an attractive and intuitive programming model that makes good use of programm...
Although improved device technology has increased the performance of computer systems, fundamental h...