In this paper we present a new technique for automatically measuring the performance of tasks, functions or arbitrary parts of a program on a multiprocessor embedded system. The technique instruments the tasks described by OpenMP, used to represent the task parallelism, while ad hoc pragmas in the source indicate other pieces of code to profile. The annotations and the instrumentation are completely target-independent, so the same code can be measured on different target architectures, on simulators or on prototypes. We validate the approach on a single and on a dual LEON 3 platform synthesized on FPGA, demonstrating a low instrumentation overhead. We show how the information obtained with this technique can be easily exploited in a hardwar...
Programmers struggle to understand performance of task-based OpenMP programs since profiling tools o...
OpenMP, a directive-based API supports multithreading programming on shared memory systems. Since O...
The performance of a computer system is important. One way of improving performance is to use multip...
In this paper we present a new technique for automatically measuring the performance of tasks, funct...
Correctly estimating the speed-up of a parallel embedded application is crucial to efficiently compa...
Parallelism has become more and more commonplace with the advent of the multicore processors. Altho...
Advances in processors architecture, such as multicore, increase the size of complexity of parallel ...
Miniaturization of electronic components has led to the introduction of complex electronic systems w...
The demand for large compute capabilities in scientific computing led to wide use and acceptance of ...
OpenMP is a popular application programming interface (API) used to write shared-memory parallel pro...
New approaches are necessary to generate performance models in current systems due the het erogeneit...
Flasskamp M, Sievers G, Ax J, et al. Performance Estimation of Streaming Applications for Hierarchic...
We have developed an environment, based upon robust, existing, open source software, for tuning appl...
AbstractOpenMP is a successful approach to writing threaded parallel applications. This article desc...
The KOJAK toolkit has been augmented with refined hardware performance counter support, including mo...
Programmers struggle to understand performance of task-based OpenMP programs since profiling tools o...
OpenMP, a directive-based API supports multithreading programming on shared memory systems. Since O...
The performance of a computer system is important. One way of improving performance is to use multip...
In this paper we present a new technique for automatically measuring the performance of tasks, funct...
Correctly estimating the speed-up of a parallel embedded application is crucial to efficiently compa...
Parallelism has become more and more commonplace with the advent of the multicore processors. Altho...
Advances in processors architecture, such as multicore, increase the size of complexity of parallel ...
Miniaturization of electronic components has led to the introduction of complex electronic systems w...
The demand for large compute capabilities in scientific computing led to wide use and acceptance of ...
OpenMP is a popular application programming interface (API) used to write shared-memory parallel pro...
New approaches are necessary to generate performance models in current systems due the het erogeneit...
Flasskamp M, Sievers G, Ax J, et al. Performance Estimation of Streaming Applications for Hierarchic...
We have developed an environment, based upon robust, existing, open source software, for tuning appl...
AbstractOpenMP is a successful approach to writing threaded parallel applications. This article desc...
The KOJAK toolkit has been augmented with refined hardware performance counter support, including mo...
Programmers struggle to understand performance of task-based OpenMP programs since profiling tools o...
OpenMP, a directive-based API supports multithreading programming on shared memory systems. Since O...
The performance of a computer system is important. One way of improving performance is to use multip...