HPC systems expose configuration options that help users optimize their applications' execution. Questions related to the best thread and data mapping, number of threads, or cache prefetching have been posed for different applications, yet they have been mostly limited to a single optimization objective (e.g., performance) and a fixed application problem size. Unfortunately, optimization strategies that work well in one scenario may generalize poorly when applied in new contexts. In this work, we investigate the impact of configuration options and different problem sizes over both performance and energy. Through a search space exploration, we have found that well-adapted NUMA-related options and cache prefetchers provide significantly more ...
Achieving high application performance depends on the combination of memory footprint, instruction m...
I/O is one of the main performance bottlenecks for many data-intensive scientific applications. Accu...
"What Mathematics is to Physics, Data traversal is to High-performance computing." The world of Comp...
HPC systems expose configuration options that help users optimize their applications' execution. Que...
Both NUMA thread/data placement and hardware prefetcher configuration have significant impacts on HP...
Both NUMA thread/data placement and hardware prefetcher configuration have significant impacts on HP...
With respect to the continuous growth of computing systems, the energy-efficiency requirement of the...
Energy and power are the main design constraints for modern high-performance computing systems. Inde...
Selecting appropriate computational resources for data processing jobs on large clusters is difficul...
Performance and energy efficiency are now critical concerns in high performance scientific computing...
Heterogeneous computing systems provide high performance and energy efficiency. However, to optimall...
High-performance computing (HPC) systems consist of thousands of compute nodes, storage systems and ...
As multi-core processor architectures with tens or even hundreds of cores, not all of them necessari...
Iterative compiler optimization has been shown to outperform static approaches. This, however, is at...
Machine Learning involves analysing large sets of training data to make predictions and decisions to...
Achieving high application performance depends on the combination of memory footprint, instruction m...
I/O is one of the main performance bottlenecks for many data-intensive scientific applications. Accu...
"What Mathematics is to Physics, Data traversal is to High-performance computing." The world of Comp...
HPC systems expose configuration options that help users optimize their applications' execution. Que...
Both NUMA thread/data placement and hardware prefetcher configuration have significant impacts on HP...
Both NUMA thread/data placement and hardware prefetcher configuration have significant impacts on HP...
With respect to the continuous growth of computing systems, the energy-efficiency requirement of the...
Energy and power are the main design constraints for modern high-performance computing systems. Inde...
Selecting appropriate computational resources for data processing jobs on large clusters is difficul...
Performance and energy efficiency are now critical concerns in high performance scientific computing...
Heterogeneous computing systems provide high performance and energy efficiency. However, to optimall...
High-performance computing (HPC) systems consist of thousands of compute nodes, storage systems and ...
As multi-core processor architectures with tens or even hundreds of cores, not all of them necessari...
Iterative compiler optimization has been shown to outperform static approaches. This, however, is at...
Machine Learning involves analysing large sets of training data to make predictions and decisions to...
Achieving high application performance depends on the combination of memory footprint, instruction m...
I/O is one of the main performance bottlenecks for many data-intensive scientific applications. Accu...
"What Mathematics is to Physics, Data traversal is to High-performance computing." The world of Comp...