In modern computer systems, memory accesses and power management are the two major performance limiting factors. Accesses to main memory are very slow when compared to operations within a processor chip. Hardware write buffers, caches, out-of-order execution, and prefetch logic, are commonly used to reduce the time spent waiting for main memory accesses. Compiler loop interchange and data layout transformations also can help. Unfortunately, large data structures often have access patterns for which none of the standard approaches are useful. Using smaller data structures can significantly improve performance by allowing the data to reside in higher levels of the memory hierarchy. This dissertation proposes using lossy data compression techn...
Achieving high application performance depends on the combination of memory footprint, instruction m...
The roofline model not only provides a powerful tool to relate an application\u27s performance with ...
Processor efficiency can be described with the help of a number of desirable effects or metrics, f...
High performance computing (HPC) demands huge memory bandwidth and computing resources to achieve ma...
Energy and power are the main design constraints for modern high-performance computing systems. Inde...
International audienceBeyond the fact of generating machine code, compilers play a critical role in ...
To improve the power consumption of parallel applications at the runtime, modern processors provide ...
Projecting performance of applications and hardware is important to several market segments—hardware...
We investigate the feasibility of using instruction compression at some level in a multi-level memor...
Programs exhibit significant performance variance in their access to microarchitectural structures. ...
textThis dissertation proves the feasibility of accurate runtime prediction of processor performance...
The resurgence of machine learning since the late 1990s has been enabled by significant advances in ...
With the increasing sophistication of both software and hardware systems, methodologies to analyse a...
Using Machine Learning to yield Scalable Program Analyses Program Analysis tackles the problem of p...
The complexity of modern computer systems makes performance modeling an invaluable resource for guid...
Achieving high application performance depends on the combination of memory footprint, instruction m...
The roofline model not only provides a powerful tool to relate an application\u27s performance with ...
Processor efficiency can be described with the help of a number of desirable effects or metrics, f...
High performance computing (HPC) demands huge memory bandwidth and computing resources to achieve ma...
Energy and power are the main design constraints for modern high-performance computing systems. Inde...
International audienceBeyond the fact of generating machine code, compilers play a critical role in ...
To improve the power consumption of parallel applications at the runtime, modern processors provide ...
Projecting performance of applications and hardware is important to several market segments—hardware...
We investigate the feasibility of using instruction compression at some level in a multi-level memor...
Programs exhibit significant performance variance in their access to microarchitectural structures. ...
textThis dissertation proves the feasibility of accurate runtime prediction of processor performance...
The resurgence of machine learning since the late 1990s has been enabled by significant advances in ...
With the increasing sophistication of both software and hardware systems, methodologies to analyse a...
Using Machine Learning to yield Scalable Program Analyses Program Analysis tackles the problem of p...
The complexity of modern computer systems makes performance modeling an invaluable resource for guid...
Achieving high application performance depends on the combination of memory footprint, instruction m...
The roofline model not only provides a powerful tool to relate an application\u27s performance with ...
Processor efficiency can be described with the help of a number of desirable effects or metrics, f...