This paper evaluates several main learning and heuris-tic techniques for application run time predictions on clus-ters and parallel supercomputers. Techniques being in-vestigated are based on mining the similarities in histor-ical workload traces. Firstly several feature selection al-gorithms are introduced and compared, such as Rough Set based “Improved Reduct ” and Genetic Algorithms. These algorithms are able to select relevant attributes that identify similar jobs in the past. Secondly two induction methods, namely Similarity Templates and Instance Based Learning (IBL), are empirically evaluated for prediction generation. Our evaluation is based on real workloads with diverse characteristics, which are collected from the LHC Comput-ing ...
Doctor of PhilosophyDepartment of Computer ScienceDaniel A. AndresenOverestimation of High Performan...
High throughput computing (HTC) has aided the scientific community in the analysis of vast amounts o...
I/O is one of the main performance bottlenecks for many data-intensive scientific applications. Accu...
The authors present a technique for deriving predictions for the run times of parallel applications ...
We present a technique for deriving predictions for the run times of parallel applications from the ...
The paper is devoted to machine learning methods and algorithms for the supercomputer jobs executio...
In large-scale Grids with many possible resources (clus-ters of computing elements) to run applicati...
Experimental performance studies on computer systems, including Grids, require deep understandings o...
The purposes of runtime prediction in grid computing are to provide quality information in order to ...
As High Performance Computing (HPC) has grown considerably and is expected to grow even more, effect...
The main goal of a Workload Management System (WMS) is to find and allocate resources for the given ...
Performance predictions for large problem sizes and processors using limited small scale runs are us...
To make effective job placement policies for a volatile large scale heterogeneous system or in grid ...
Mathematical solvers have evolved to become complex software and thereby have become a difficult sub...
A good running time prediction of tasks is very helpful and important for job scheduling and resourc...
Doctor of PhilosophyDepartment of Computer ScienceDaniel A. AndresenOverestimation of High Performan...
High throughput computing (HTC) has aided the scientific community in the analysis of vast amounts o...
I/O is one of the main performance bottlenecks for many data-intensive scientific applications. Accu...
The authors present a technique for deriving predictions for the run times of parallel applications ...
We present a technique for deriving predictions for the run times of parallel applications from the ...
The paper is devoted to machine learning methods and algorithms for the supercomputer jobs executio...
In large-scale Grids with many possible resources (clus-ters of computing elements) to run applicati...
Experimental performance studies on computer systems, including Grids, require deep understandings o...
The purposes of runtime prediction in grid computing are to provide quality information in order to ...
As High Performance Computing (HPC) has grown considerably and is expected to grow even more, effect...
The main goal of a Workload Management System (WMS) is to find and allocate resources for the given ...
Performance predictions for large problem sizes and processors using limited small scale runs are us...
To make effective job placement policies for a volatile large scale heterogeneous system or in grid ...
Mathematical solvers have evolved to become complex software and thereby have become a difficult sub...
A good running time prediction of tasks is very helpful and important for job scheduling and resourc...
Doctor of PhilosophyDepartment of Computer ScienceDaniel A. AndresenOverestimation of High Performan...
High throughput computing (HTC) has aided the scientific community in the analysis of vast amounts o...
I/O is one of the main performance bottlenecks for many data-intensive scientific applications. Accu...