Prediction of the performance of parallel applications is a concept useful in several domains of software operation. In the commercial world, it’s often useful to be able to anticipate how an application will perform on a customer’s machine with a minimal burden to the user. In the same spirit, it’s in the best interest of a user/consumer of computational software to most optimally operate it. In the super-computing/distributed computing world, being able to anticipate the performance of an application on a set of compute-nodes allows one to more optimally select the set of nodes to execute on. In terms of a large-scale shared computing environment where parallel computational jobs are assigned resources and scheduled for execution, being a...
Standard benchmarking provides the run times for given programs on given machines, but fails to prov...
In this paper, we describe a model for determining the optimal data and computation decomposition fo...
International audienceEstimating the potential performance of parallel applicationson the yet-to-be-...
While parallel computing offers an attractive perspective for the future, developing efficient paral...
We address the problem of performance prediction for parallel programs executed on clusters of heter...
We propose a model for describing the parallel performance of multigrid software on distributed mem...
In this paper, we investigate the traffic characteristics of parallel and high performance computi...
Systems for high performance computing are getting increasingly complex. On the one hand, the number...
Context. Today’s parallel systems are widely used in different computational tasks. Developing paral...
The CPUs, memory, interconnection network, operating system, runtime system, I/O subsystem, and appl...
High-performance computing is essential for solving large problems and for reducing the time to solu...
The performance of a computer system is important. One way of improving performance is to use multip...
In order to measure the performance of a parallel machine, a set of application kernels as benchmark...
International audienceIncreasingly complex consumer electronics applications call for embedded proce...
We propose a model for describing and predicting the performance of practical parallel engineering ...
Standard benchmarking provides the run times for given programs on given machines, but fails to prov...
In this paper, we describe a model for determining the optimal data and computation decomposition fo...
International audienceEstimating the potential performance of parallel applicationson the yet-to-be-...
While parallel computing offers an attractive perspective for the future, developing efficient paral...
We address the problem of performance prediction for parallel programs executed on clusters of heter...
We propose a model for describing the parallel performance of multigrid software on distributed mem...
In this paper, we investigate the traffic characteristics of parallel and high performance computi...
Systems for high performance computing are getting increasingly complex. On the one hand, the number...
Context. Today’s parallel systems are widely used in different computational tasks. Developing paral...
The CPUs, memory, interconnection network, operating system, runtime system, I/O subsystem, and appl...
High-performance computing is essential for solving large problems and for reducing the time to solu...
The performance of a computer system is important. One way of improving performance is to use multip...
In order to measure the performance of a parallel machine, a set of application kernels as benchmark...
International audienceIncreasingly complex consumer electronics applications call for embedded proce...
We propose a model for describing and predicting the performance of practical parallel engineering ...
Standard benchmarking provides the run times for given programs on given machines, but fails to prov...
In this paper, we describe a model for determining the optimal data and computation decomposition fo...
International audienceEstimating the potential performance of parallel applicationson the yet-to-be-...