Operating system process scheduling has been an active area of research for many years. Process scheduling decisions can have a dramatic impact on capacity and conflict misses in on-chip caches, since processes that do not share memory will compete for entries in the cache, while processes that do share memory can effectively pre-fetch blocks or warm the cache in a symbiotic fashion. In this paper we provide a detailed characterization of context switch misses and quantify its impact. We also investigate the potential of intelligent process scheduling that minimizes cache misses across context-switch boundaries. We have developed several greedy heuristics that enable us to perform a trace driven limit study on the benefits of cache friendly...
Nearly all modern computing systems employ caches to hide the memory latency. Modern processors ofte...
This paper proposes an analytical cache model for time-shared systems focusing on fully-associative ...
This paper describes a method to improve the cache locality of sequential programs by scheduling fin...
As a process executes on a CPU, it builds up state in that CPU's cache. In multiprogrammed work...
Computer system performance has been pushed further and further for decades, and hence the complexit...
Hyper-threaded systems show an increase in popularity in modern computers due to the performance imp...
A feature in modern operating systems is the ability to switch between programs so they appear to ru...
As the trend of more cores sharing common resources on a single die and more systems crammed into en...
Cache utilisation is often very poor in multithreaded applications, due to the loss of data access l...
Context switching times are a major source of overhead in medium to fine grain process scheduling. W...
An accurate, tractable, analytic cache model for time-shared systems is presented, which estimates t...
We present a new operating system scheduling algorithm for multicore processors. Our algorithm reduc...
Guaranteeing time-predictable execution in real-time systems involves the management of not only pro...
© 2012 IEEE. Personal use of this material is permitted. Permission from IEEE must be obtained for a...
Modern embedded CPU systems rely on a growing number of software features, but this growth increases...
Nearly all modern computing systems employ caches to hide the memory latency. Modern processors ofte...
This paper proposes an analytical cache model for time-shared systems focusing on fully-associative ...
This paper describes a method to improve the cache locality of sequential programs by scheduling fin...
As a process executes on a CPU, it builds up state in that CPU's cache. In multiprogrammed work...
Computer system performance has been pushed further and further for decades, and hence the complexit...
Hyper-threaded systems show an increase in popularity in modern computers due to the performance imp...
A feature in modern operating systems is the ability to switch between programs so they appear to ru...
As the trend of more cores sharing common resources on a single die and more systems crammed into en...
Cache utilisation is often very poor in multithreaded applications, due to the loss of data access l...
Context switching times are a major source of overhead in medium to fine grain process scheduling. W...
An accurate, tractable, analytic cache model for time-shared systems is presented, which estimates t...
We present a new operating system scheduling algorithm for multicore processors. Our algorithm reduc...
Guaranteeing time-predictable execution in real-time systems involves the management of not only pro...
© 2012 IEEE. Personal use of this material is permitted. Permission from IEEE must be obtained for a...
Modern embedded CPU systems rely on a growing number of software features, but this growth increases...
Nearly all modern computing systems employ caches to hide the memory latency. Modern processors ofte...
This paper proposes an analytical cache model for time-shared systems focusing on fully-associative ...
This paper describes a method to improve the cache locality of sequential programs by scheduling fin...