The multicore era has initiated a move to ubiquitous parallelization of software. In the process, cores have scaled out but the memory subsystem resources have not kept up. Memory subsystem contention within and between applications makes it challenging to extract performance scaling that matches the increase in the number of cores. This dissertation explores the diagnosis of memory subsystem contention, identifies associated performance and energy efficiency opportunities, and suggests techniques and optimizations to both precisely measure and reduce the contention. The dissertation begins by exploring contention within a single and between multiple, large-scale, distributed scientific applications and moves to exploring the impact of memo...
Our thesis is that operating systems should manage the on-chip shared caches of multicore processors...
Modern microprocessors integrate a growing number of compo-nents on a single chip, such as processor...
Our thesis is that operating systems should manage the on-chip shared caches of multicore processors...
Parallelism is ubiquitous in modern computer architectures. Heterogeneity of CPU cores and deep memo...
With the breakdown of Dennard Scaling and Moore's law, power consumption appears to be a primary cha...
Systems for high performance computing are getting increasingly complex. On the one hand, the number...
Systems for high performance computing are getting increasingly complex. On the one hand, the number...
Multi-core computers are infamous for being hard to use in time-critical systems due to execution-ti...
Multi-core computers are infamous for being hard to use in time-critical systems due to execution-ti...
Main memory in clusters may dominate total system power. The resulting energy consumption increases ...
Multi-core processors dominate current mainframe, server, and high performance computing (HPC) syste...
Multicore processors have become ubiquitous in today's computing platforms, extending from smartphon...
<p>The continued growth of the computational capability of throughput processors has made throughput...
Minimizing power, increasing performance, and delivering effective memory bandwidth are today's prim...
Shared-memory multiprocessors built from commodity microprocessors are being increasingly used to pr...
Our thesis is that operating systems should manage the on-chip shared caches of multicore processors...
Modern microprocessors integrate a growing number of compo-nents on a single chip, such as processor...
Our thesis is that operating systems should manage the on-chip shared caches of multicore processors...
Parallelism is ubiquitous in modern computer architectures. Heterogeneity of CPU cores and deep memo...
With the breakdown of Dennard Scaling and Moore's law, power consumption appears to be a primary cha...
Systems for high performance computing are getting increasingly complex. On the one hand, the number...
Systems for high performance computing are getting increasingly complex. On the one hand, the number...
Multi-core computers are infamous for being hard to use in time-critical systems due to execution-ti...
Multi-core computers are infamous for being hard to use in time-critical systems due to execution-ti...
Main memory in clusters may dominate total system power. The resulting energy consumption increases ...
Multi-core processors dominate current mainframe, server, and high performance computing (HPC) syste...
Multicore processors have become ubiquitous in today's computing platforms, extending from smartphon...
<p>The continued growth of the computational capability of throughput processors has made throughput...
Minimizing power, increasing performance, and delivering effective memory bandwidth are today's prim...
Shared-memory multiprocessors built from commodity microprocessors are being increasingly used to pr...
Our thesis is that operating systems should manage the on-chip shared caches of multicore processors...
Modern microprocessors integrate a growing number of compo-nents on a single chip, such as processor...
Our thesis is that operating systems should manage the on-chip shared caches of multicore processors...