Recent scheduling heuristics for task-based applications have managed to improve their by taking into account memory-related properties such as data locality and cache sharing. However, there is still a general lack of tools that can provide insights into why, and where, different schedulers improve memory behavior, and how this is related to the applications' performance. To address this, we present TaskInsight, a technique to characterize the memory behavior of different task schedulers through the analysis of data reuse between tasks. TaskInsight provides high-level, quantitative information that can be correlated with tasks' performance variation over time to understand data reuse through the caches due to scheduling choices. TaskInsigh...
The multicore era has initiated a move to ubiquitous parallelization of software. In the process, co...
Task-based programming models are becoming increasingly important, as they can reduce the synchroniz...
Caches help reduce the average execution time of tasks due to their fast operational speeds. However...
Recent scheduling heuristics for task-based applications have managed to improve their by taking int...
Maximizing the performance of computer systems while making them more energy efficient is vital for ...
Making computer systems more energy efficient while obtaining the maximum performance possible is ke...
Proceedings of the First PhD Symposium on Sustainable Ultrascale Computing Systems (NESUS PhD 2016)...
The shift toward multicore processors has transformed the software and hardware landscape in the las...
Abstract—This paper proposes a methodology to study the data reuse quality of task-parallel runtimes...
Abstract. This paper proposes a methodology to study the data reuse quality of task-parallel runtime...
Scientific and technological advances in the area of integrated circuits have allowed the performanc...
Modern computer architectures expose an increasing number of parallel features supported by complex ...
Abstract. We develop a new metric for job scheduling that in-cludes the effects of memory contention...
In multicore systems, shared resources such as caches or the memory subsystem can lead to contention...
Architects have adopted the shared memory model that implicitly manages cache coherence and cache ca...
The multicore era has initiated a move to ubiquitous parallelization of software. In the process, co...
Task-based programming models are becoming increasingly important, as they can reduce the synchroniz...
Caches help reduce the average execution time of tasks due to their fast operational speeds. However...
Recent scheduling heuristics for task-based applications have managed to improve their by taking int...
Maximizing the performance of computer systems while making them more energy efficient is vital for ...
Making computer systems more energy efficient while obtaining the maximum performance possible is ke...
Proceedings of the First PhD Symposium on Sustainable Ultrascale Computing Systems (NESUS PhD 2016)...
The shift toward multicore processors has transformed the software and hardware landscape in the las...
Abstract—This paper proposes a methodology to study the data reuse quality of task-parallel runtimes...
Abstract. This paper proposes a methodology to study the data reuse quality of task-parallel runtime...
Scientific and technological advances in the area of integrated circuits have allowed the performanc...
Modern computer architectures expose an increasing number of parallel features supported by complex ...
Abstract. We develop a new metric for job scheduling that in-cludes the effects of memory contention...
In multicore systems, shared resources such as caches or the memory subsystem can lead to contention...
Architects have adopted the shared memory model that implicitly manages cache coherence and cache ca...
The multicore era has initiated a move to ubiquitous parallelization of software. In the process, co...
Task-based programming models are becoming increasingly important, as they can reduce the synchroniz...
Caches help reduce the average execution time of tasks due to their fast operational speeds. However...