Recent trends in OS research have shown evidence that there are performance benefits to running OS services on different cores than the user applications that rely on them. We quantitatively evaluate this claim in terms of one of the most significant architectural constraints: memory performance. To this end, we have created CachEMU, an open-source memory trace generator and cache simulator built as an extension to QEMU for working with system traces. Using CachEMU, we determined that for five common Linux test workloads, it was best to run the OS close, but not too close on the same package, but not on the same core
The next decade will afford us computer chips with 1,000 - 10,000 cores on a single piece of silicon...
Personal computing hardware is becoming ever more complex with more cores being added. It is moving ...
The multicore era has initiated a move to ubiquitous parallelization of software. In the process, co...
Journal ArticleLarge-scale chip multiprocessors will likely be heterogeneous. It has been suggested ...
Journal ArticleWe identify the operating system as one area where a novel architecture could signif...
dissertationWith the explosion of chip transistor counts, the semiconductor industry has struggled w...
Multiprocessor application performance can be limited by the operating system when the application u...
Application performance on modern processors has become increasingly dictated by the use of on-chip ...
Our thesis is that operating systems should manage the on-chip shared caches of multicore processors...
Computer systems are rapidly changing. Over the next few years, we will see wide-scale deployment of...
Cache partitioning and sharing is critical to the effective utilization of multicore processors. How...
Cache memory is one of the most important components of a computer system. The cache allows quickly...
We argue that OS-provided data coherence on non-cache-coherent NUMA multiprocessors (machines with a...
The number of cores which fit on a single chip is growing at an exponential rate while off-chip main...
Recent many-core processors such as Intel’s Xeon Phi and GPGPUs specialize in running highly scalabl...
The next decade will afford us computer chips with 1,000 - 10,000 cores on a single piece of silicon...
Personal computing hardware is becoming ever more complex with more cores being added. It is moving ...
The multicore era has initiated a move to ubiquitous parallelization of software. In the process, co...
Journal ArticleLarge-scale chip multiprocessors will likely be heterogeneous. It has been suggested ...
Journal ArticleWe identify the operating system as one area where a novel architecture could signif...
dissertationWith the explosion of chip transistor counts, the semiconductor industry has struggled w...
Multiprocessor application performance can be limited by the operating system when the application u...
Application performance on modern processors has become increasingly dictated by the use of on-chip ...
Our thesis is that operating systems should manage the on-chip shared caches of multicore processors...
Computer systems are rapidly changing. Over the next few years, we will see wide-scale deployment of...
Cache partitioning and sharing is critical to the effective utilization of multicore processors. How...
Cache memory is one of the most important components of a computer system. The cache allows quickly...
We argue that OS-provided data coherence on non-cache-coherent NUMA multiprocessors (machines with a...
The number of cores which fit on a single chip is growing at an exponential rate while off-chip main...
Recent many-core processors such as Intel’s Xeon Phi and GPGPUs specialize in running highly scalabl...
The next decade will afford us computer chips with 1,000 - 10,000 cores on a single piece of silicon...
Personal computing hardware is becoming ever more complex with more cores being added. It is moving ...
The multicore era has initiated a move to ubiquitous parallelization of software. In the process, co...