Optimizing parallel programs using composable locality models

Luo, Hao
Ding, Chen (1970 - )

Publication date

June 2017

Publisher

University of Rochester

Abstract

Thesis (Ph. D.)--University of Rochester. Department of Computer Science, 2017On modern processors, the on-chip cache memory is structured in a hierarchy, in order to accommodate the rapidly growing disparity between processor peak speed and off-chip memory speed. This design makes a program’s performance highly correlated with its memory access pattern and where the accessed data are positioned within the hierarchy. Locality analysis is to study such correlation and optimize programs accordingly. However, the existing research effort in locality analysis is rather limited when dealing with contemporary parallel workloads. The performance of these workloads can be significantly influenced by how their threads interactively access da...

Extracted data

We use cookies to provide a better user experience.

Data Protection

Optimizing parallel programs using composable locality models

Abstract

Extracted data

Optimizing parallel programs using composable locality models

Abstract

Extracted data

Related items

Related items