The end of Dennard scaling has brought energy savings to the forefront of processor design. When coupled with massive datasets, traditional memory architectures are becoming a source of excess energy expenditure. In this thesis, we propose to conserve energy through the co-location of code and data, which limits data movement on and off the chip. To that end, we design a data prefetcher called Tempo that reduces useless prefetches by 18% and increases timeliness by 43%. Tempo boosts the performance of the state of the art Spatial Memory Streaming prefetcher by up to 20% (2.54% average). In contrast to data prefetching, we also describe an execution paradigm called Anti-Fetching, which dynamically moves code off-chip to a Near-Data Processor...
Superscalar processors contain large, complex structures to hold data and instructions as they wait ...
Energy efficiency has become one of the most important factors in the development of computer system...
Energy efficiency is becoming a major constraint in processor designs. Every component of the proces...
The end of Dennard scaling has brought energy savings to the forefront of processor design. When cou...
Prefetching has emerged as one of the most successful techniques to bridge the gap between modern pr...
There has been intensive research on data prefetching focusing on performance improvement, however, ...
Extensive research has been done in prefetching techniques that hide memory latency in microprocesso...
Memory accesses in modern processors are both far slower and vastly more energy-expensive than the a...
As data prefetching is used in embedded processors, it is crucial to reduce the wasted energy for im...
As process technology shrinks, the transistor count on CPUs has increased. The breakdown of Dennard ...
An energy-efficient architecture should jointly optimize energy consumption and throughput, as captu...
The line size/performance trade-offs in off-chip second-level caches in light of energy-efficiency a...
Processor performance has increased far faster than memories have been able to keep up with, forcing...
The number of battery powered devices is growing significantly and these devices require energy-effi...
Memory subsystem with larger capacity and deeper hierarchy has been designed to achieve the maximum ...
Superscalar processors contain large, complex structures to hold data and instructions as they wait ...
Energy efficiency has become one of the most important factors in the development of computer system...
Energy efficiency is becoming a major constraint in processor designs. Every component of the proces...
The end of Dennard scaling has brought energy savings to the forefront of processor design. When cou...
Prefetching has emerged as one of the most successful techniques to bridge the gap between modern pr...
There has been intensive research on data prefetching focusing on performance improvement, however, ...
Extensive research has been done in prefetching techniques that hide memory latency in microprocesso...
Memory accesses in modern processors are both far slower and vastly more energy-expensive than the a...
As data prefetching is used in embedded processors, it is crucial to reduce the wasted energy for im...
As process technology shrinks, the transistor count on CPUs has increased. The breakdown of Dennard ...
An energy-efficient architecture should jointly optimize energy consumption and throughput, as captu...
The line size/performance trade-offs in off-chip second-level caches in light of energy-efficiency a...
Processor performance has increased far faster than memories have been able to keep up with, forcing...
The number of battery powered devices is growing significantly and these devices require energy-effi...
Memory subsystem with larger capacity and deeper hierarchy has been designed to achieve the maximum ...
Superscalar processors contain large, complex structures to hold data and instructions as they wait ...
Energy efficiency has become one of the most important factors in the development of computer system...
Energy efficiency is becoming a major constraint in processor designs. Every component of the proces...