Correctly estimating the speed-up of a parallel embedded application is crucial to efficiently compare different parallelization techniques, task graph transformations or mapping and scheduling solutions. Unfortunately, especially in case of control-dominated applications, task correlations may heavily affect the execution time of the solutions and usually this is not properly taken into account during performance analysis. We propose a methodology that combines a single profiling of the initial sequential specification with different decisions in terms of partitioning, mapping, and scheduling in order to better estimate the actual speed-up of these solutions. We validated our approach on a multi-processor simulation platform: experimental ...
Although parallel processing is a promising way of increasing the performance cost efficiently, it i...
In signal processing area, applications involve a large amount of computation, suggesting the use of...
We analyse the inherent performance of parallel software. For this end we use a task graph to model ...
Correctly estimating the speed-up of a parallel embedded application is crucial to efficiently compa...
The speed-up estimation of parallelized code is crucial to efficiently compare different paralleliza...
In this paper we present a new technique for automatically measuring the performance of tasks, funct...
The effective use of computational resources requires a good understanding of parallel architectures...
In this paper, we describe a model for determining the optimal data and computation decomposition fo...
Performance analysis of parallel programs continues to be challenging for programmers. Programmers h...
The development of a new graph theoretic model for describing the relation between a decomposed algo...
Flasskamp M, Sievers G, Ax J, et al. Performance Estimation of Streaming Applications for Hierarchic...
Sequential graph algorithms are implemented through ordered execution of tasks to achieve high work ...
Computer performance modeling of possibly complex computations running on highly concurrent systems ...
The performance of a computer system is important. One way of improving performance is to use multip...
A considerably fraction of science discovery is nowadays relying on computer simulations. High Per...
Although parallel processing is a promising way of increasing the performance cost efficiently, it i...
In signal processing area, applications involve a large amount of computation, suggesting the use of...
We analyse the inherent performance of parallel software. For this end we use a task graph to model ...
Correctly estimating the speed-up of a parallel embedded application is crucial to efficiently compa...
The speed-up estimation of parallelized code is crucial to efficiently compare different paralleliza...
In this paper we present a new technique for automatically measuring the performance of tasks, funct...
The effective use of computational resources requires a good understanding of parallel architectures...
In this paper, we describe a model for determining the optimal data and computation decomposition fo...
Performance analysis of parallel programs continues to be challenging for programmers. Programmers h...
The development of a new graph theoretic model for describing the relation between a decomposed algo...
Flasskamp M, Sievers G, Ax J, et al. Performance Estimation of Streaming Applications for Hierarchic...
Sequential graph algorithms are implemented through ordered execution of tasks to achieve high work ...
Computer performance modeling of possibly complex computations running on highly concurrent systems ...
The performance of a computer system is important. One way of improving performance is to use multip...
A considerably fraction of science discovery is nowadays relying on computer simulations. High Per...
Although parallel processing is a promising way of increasing the performance cost efficiently, it i...
In signal processing area, applications involve a large amount of computation, suggesting the use of...
We analyse the inherent performance of parallel software. For this end we use a task graph to model ...