This dissertation investigates prefetching scheme for servers with respect to realistic memory systems. A large body of research work has been done in prefetching, even for server workloads that have sparse locality. Real systems disable prefetching in server settings, suggesting that there is a fundamental disconnect between research and practice. Our theory, a major point of this thesis, is that this disconnect is due to the use of simplistic memory models -- and our experimental results show that, among other things, using simplistic models can over-predict the system performance by up to 65%. Our investigation proceeds as follows: (In)Accuracy of Simplistic Memory Models. We demonstrate the degrees of inaccuracy of models commonly us...
textData Prefetching is a well-known technique to speed up applications wherein hardware prefetchers...
Memory latency has always been a major issue in shared-memory multiprocessors and high-speed systems...
In the last century great progress was achieved in developing processors with extremely high computa...
International audienceIn multi-core systems, an application's prefetcher can interfere with the memo...
this paper, we examine the way in which prefetching can exploit parallelism. Prefetching has been st...
A well known performance bottleneck in computer architecture is the so-called memory wall. This term...
This paper presents new analytical models of the performance be-nefits of multithreading and prefetc...
In recent years, there has been a growing trend towards using multi-core processors in real-time sys...
textModern computer systems spend a substantial fraction of their running time waiting for data from...
This thesis considers two approaches to the design of high-performance computers. In a <I>single pro...
Prefetching, i.e., exploiting the overlap of processor com-putations with data accesses, is one of s...
Chip Multiprocessors (CMP) are an increasingly popular architecture and increasing numbers of vendor...
Software prefetching and locality optimizations are two techniques for overcoming the speed gap bet...
Data prefetching has been widely studied as a technique to hide memory access latency in multiproces...
External Memory models, most notable being the I-O Model [3], capture the effects of memory hierarch...
textData Prefetching is a well-known technique to speed up applications wherein hardware prefetchers...
Memory latency has always been a major issue in shared-memory multiprocessors and high-speed systems...
In the last century great progress was achieved in developing processors with extremely high computa...
International audienceIn multi-core systems, an application's prefetcher can interfere with the memo...
this paper, we examine the way in which prefetching can exploit parallelism. Prefetching has been st...
A well known performance bottleneck in computer architecture is the so-called memory wall. This term...
This paper presents new analytical models of the performance be-nefits of multithreading and prefetc...
In recent years, there has been a growing trend towards using multi-core processors in real-time sys...
textModern computer systems spend a substantial fraction of their running time waiting for data from...
This thesis considers two approaches to the design of high-performance computers. In a <I>single pro...
Prefetching, i.e., exploiting the overlap of processor com-putations with data accesses, is one of s...
Chip Multiprocessors (CMP) are an increasingly popular architecture and increasing numbers of vendor...
Software prefetching and locality optimizations are two techniques for overcoming the speed gap bet...
Data prefetching has been widely studied as a technique to hide memory access latency in multiproces...
External Memory models, most notable being the I-O Model [3], capture the effects of memory hierarch...
textData Prefetching is a well-known technique to speed up applications wherein hardware prefetchers...
Memory latency has always been a major issue in shared-memory multiprocessors and high-speed systems...
In the last century great progress was achieved in developing processors with extremely high computa...