Application performance is determined by a combination of many choices: hardware platform, runtime environment, languages and compilers used, algorithm choice and implementation, and more. In this complicated environment, we find that the use of mini-applications - small self-contained proxies for real applications - is an excellent approach for rapidly exploring the parameter space of all these choices. Furthermore, use of mini-applications enriches the interaction between application, library and computer system developers by providing explicit functioning software and concrete performance results that lead to detailed, focused discussions of design trade-offs, algorithm choices and runtime performance issues. In this paper we discuss a c...
International audienceThe advent of multicore and manycore processors, including GPUs, in the custom...
CPU clock frequency is not likely to be increased significantly in the coming years, and data analys...
AbstractThe advent of multicore and manycore processors, including GPUs, in the customer market enco...
Computational science and engineering application programs are typically large, complex, and dynamic...
Miniapps serve as test beds for prototyping and evaluating new al-gorithms, data structures, and pro...
There are several reasons why the performance of current distributed and heterogeneous systems is of...
In high-performance computing, performance analysis, tuning, and exploration are relevant throughout...
Abstract. State-of-the-art run-time systems are a poor match to diverse, dynamic distributed applica...
While parallel computing offers an attractive perspective for the future, developing efficient paral...
The size and complexity of high-performance computing applications present a serious challenge to ma...
Computers perform different applications in different ways. To characterize an application performan...
State-of-the-art run-time systems are a poor match to diverse, dynamic distributed applications beca...
In this work, several mini-apps have been created to enhance a real-world application performance, n...
This work describes ongoing work for measuring the performance of an application running on a machin...
Performance is the critical feature in the design and productivity of software systems. A key to imp...
International audienceThe advent of multicore and manycore processors, including GPUs, in the custom...
CPU clock frequency is not likely to be increased significantly in the coming years, and data analys...
AbstractThe advent of multicore and manycore processors, including GPUs, in the customer market enco...
Computational science and engineering application programs are typically large, complex, and dynamic...
Miniapps serve as test beds for prototyping and evaluating new al-gorithms, data structures, and pro...
There are several reasons why the performance of current distributed and heterogeneous systems is of...
In high-performance computing, performance analysis, tuning, and exploration are relevant throughout...
Abstract. State-of-the-art run-time systems are a poor match to diverse, dynamic distributed applica...
While parallel computing offers an attractive perspective for the future, developing efficient paral...
The size and complexity of high-performance computing applications present a serious challenge to ma...
Computers perform different applications in different ways. To characterize an application performan...
State-of-the-art run-time systems are a poor match to diverse, dynamic distributed applications beca...
In this work, several mini-apps have been created to enhance a real-world application performance, n...
This work describes ongoing work for measuring the performance of an application running on a machin...
Performance is the critical feature in the design and productivity of software systems. A key to imp...
International audienceThe advent of multicore and manycore processors, including GPUs, in the custom...
CPU clock frequency is not likely to be increased significantly in the coming years, and data analys...
AbstractThe advent of multicore and manycore processors, including GPUs, in the customer market enco...