Efficient use of hardware resources is a vital part of getting good results within high performance computing. This thesis explores the predictability of optimal CPU-core distribution between two tasks running in parallel on a shared-memory machine, with the intent to reach the shortest total runtime possible. The predictions are based on the weight and speedup of each task, in regards to the CPU-frequency decrease that comes with a growing number of active cores in modern CPUs. The weight of a task is the number of floating point operations needed to compute it to completion. The Intel oneAPI Math Kernel Library is used to create a set of different tasks, where each task consists of a single call to a dgemm-routine. Two prediction algorith...
Nowadays multicores machines are becoming more and more common. Ideally, all the applications benefi...
We present a model of multithreaded computation with an emphasis on estimat-ing parallelism overhead...
The efficiency of a multi-core architecture is directly related to the mechanisms that map the thre...
Efficient use of hardware resources is a vital part of getting good results within high performance ...
Conference of 9th IEEE International Symposium on Embedded Multicore/Manycore SoCs, MCSoC 2015 ; Con...
The efficiency of a multi-core architecture is directly related to the mechanisms that map the threa...
Prediction of the performance of parallel applications is a concept useful in several domains of sof...
Today's computers have processors with multiple cores that allow several applications to execute sim...
The efficient mapping of program parallelism to multi-core processors is highly dependent on the und...
This work is a part of the global tendency to use modern computing systems for modeling the phase-fi...
To make the best use of the resources in a shared grid environment, an application scheduler must ma...
In this dissertation we present a methodology for predicting the best priority pair for a given co-s...
As computers with tens of thousands of processors successfully deliver high performance power for so...
AbstractThe current trends in processor industry opens the way to next generations of microprocessor...
Predicting the execution time of parallel programs involves computing the maximum or minimum of the ...
Nowadays multicores machines are becoming more and more common. Ideally, all the applications benefi...
We present a model of multithreaded computation with an emphasis on estimat-ing parallelism overhead...
The efficiency of a multi-core architecture is directly related to the mechanisms that map the thre...
Efficient use of hardware resources is a vital part of getting good results within high performance ...
Conference of 9th IEEE International Symposium on Embedded Multicore/Manycore SoCs, MCSoC 2015 ; Con...
The efficiency of a multi-core architecture is directly related to the mechanisms that map the threa...
Prediction of the performance of parallel applications is a concept useful in several domains of sof...
Today's computers have processors with multiple cores that allow several applications to execute sim...
The efficient mapping of program parallelism to multi-core processors is highly dependent on the und...
This work is a part of the global tendency to use modern computing systems for modeling the phase-fi...
To make the best use of the resources in a shared grid environment, an application scheduler must ma...
In this dissertation we present a methodology for predicting the best priority pair for a given co-s...
As computers with tens of thousands of processors successfully deliver high performance power for so...
AbstractThe current trends in processor industry opens the way to next generations of microprocessor...
Predicting the execution time of parallel programs involves computing the maximum or minimum of the ...
Nowadays multicores machines are becoming more and more common. Ideally, all the applications benefi...
We present a model of multithreaded computation with an emphasis on estimat-ing parallelism overhead...
The efficiency of a multi-core architecture is directly related to the mechanisms that map the thre...