It is essential to accurately estimate the working set size (WSS) of an application for various optimizations such as to partition cache among virtual machines or reduce leakage power dissipated in an over-allocated cache by switching it OFF. However, the state-of-the-art heuristics such as average memory access latency (AMAL) or cache miss ratio (CMR) are poorly correlated to the WSS of an application due to 1) over-sized caches and 2) their dispersed nature. Past studies focus on estimating WSS of an application executing on a uniprocessor platform. Estimating the same for a chip multiprocessor (CMP) with a large dispersed cache is challenging due to the presence of concurrently executing threads/processes. Hence, we propose a scalable, h...
This paper addresses feedback-directed restructuring techniques tuned to Non Uniform Cache Architect...
This data set contains the results presented in the paper "Custom Multi-Cache Architectures for Heap...
With the advent of increasingly complex hardware in real-time embedded systems (processors with perf...
The number of processor cores and on-chip cache size has been increasing on chip multiprocessors (CM...
The number of processor cores and on-chip cache size has been increasing on chip multiprocessors (CM...
With the rapid increase of data set size of cloud and big data applications, conventional regular 4K...
Abstract—The ongoing move to chip multiprocessors (CMPs) permits greater sharing of last-level cache...
Modern chip-level multiprocessors (CMPs) contain multiple processor cores sharing a common last-leve...
The increasing levels of transistor density have enabled integration of an increasing number of core...
Embedded system software is highly constrained from performance, memory footprint, energy consumptio...
Multi-core chips have been increasingly adopted by the microprocessor industry. For real-time system...
Embedded systems are getting popular in today’s world. They are usually small and thus have a limite...
This dissertation presents several models for performance, power, and thermal estimations in high-pe...
Thesis: Ph. D., Massachusetts Institute of Technology, Department of Electrical Engineering and Comp...
In a multicore system, effective management of shared last level cache (LLC), such as hardware/softw...
This paper addresses feedback-directed restructuring techniques tuned to Non Uniform Cache Architect...
This data set contains the results presented in the paper "Custom Multi-Cache Architectures for Heap...
With the advent of increasingly complex hardware in real-time embedded systems (processors with perf...
The number of processor cores and on-chip cache size has been increasing on chip multiprocessors (CM...
The number of processor cores and on-chip cache size has been increasing on chip multiprocessors (CM...
With the rapid increase of data set size of cloud and big data applications, conventional regular 4K...
Abstract—The ongoing move to chip multiprocessors (CMPs) permits greater sharing of last-level cache...
Modern chip-level multiprocessors (CMPs) contain multiple processor cores sharing a common last-leve...
The increasing levels of transistor density have enabled integration of an increasing number of core...
Embedded system software is highly constrained from performance, memory footprint, energy consumptio...
Multi-core chips have been increasingly adopted by the microprocessor industry. For real-time system...
Embedded systems are getting popular in today’s world. They are usually small and thus have a limite...
This dissertation presents several models for performance, power, and thermal estimations in high-pe...
Thesis: Ph. D., Massachusetts Institute of Technology, Department of Electrical Engineering and Comp...
In a multicore system, effective management of shared last level cache (LLC), such as hardware/softw...
This paper addresses feedback-directed restructuring techniques tuned to Non Uniform Cache Architect...
This data set contains the results presented in the paper "Custom Multi-Cache Architectures for Heap...
With the advent of increasingly complex hardware in real-time embedded systems (processors with perf...