Machine learning algorithms are widely used today for analytical tasks such as data cleaning, data categorization, or data filtering. At the same time, the rise of social media motivates recent uptake in large scale graph processing. Both categories of algorithms are dominated by iterative subtasks, i.e., processing steps which are executed repetitively until a convergence condition is met. Optimizing cluster resource allocations among multiple workloads of iterative algorithms motivates the need for estimating their resource requirements and runtime, which in turn requires: i) predicting the number of iterations, and ii) predicting the processing time of each iteration. As both parameters depend on the characteristics of the dataset and on...
Training classifiers on large databases is computationally demand-ing. It is desirable to develop ef...
Several activities of Web-based architectures are managed by algorithms that take runtime decisions ...
Given the rapid rise in energy demand by data centers and computing systems in general, it is fundam...
The ability to handle and analyse massive amounts of data has been progressively improved during the...
The explosion of Social Network Analysis (SNA) in many different areas and the growing need for pow...
The ability to accurately estimate the execution time of computationally expensive e-science algorit...
With the increasing amount of data available to scientists in disciplines as diverse as bioinformati...
Perhaps surprisingly, it is possible to predict how long an algorithm will take to run on a previous...
Big Data processing systems (e.g., Spark) have a number of resource configuration parameters, such a...
Thesis (Ph.D.)--University of Washington, 2018Large-scale data analytics is key to modern science, t...
In cloud systems, computation time can be rented by the hour and for a given number of processors. T...
Improving the efficiency of big cloud providers has become a very difficult task. The great quantity...
One of the fundamental machine learning tasks is that of predictive classification. Given that organ...
Estimates of task runtime, disk space usage, and memory consumption, are commonly used by scheduling...
Large organizations have seamlessly incorporated data-driven decision making in their operations. Ho...
Training classifiers on large databases is computationally demand-ing. It is desirable to develop ef...
Several activities of Web-based architectures are managed by algorithms that take runtime decisions ...
Given the rapid rise in energy demand by data centers and computing systems in general, it is fundam...
The ability to handle and analyse massive amounts of data has been progressively improved during the...
The explosion of Social Network Analysis (SNA) in many different areas and the growing need for pow...
The ability to accurately estimate the execution time of computationally expensive e-science algorit...
With the increasing amount of data available to scientists in disciplines as diverse as bioinformati...
Perhaps surprisingly, it is possible to predict how long an algorithm will take to run on a previous...
Big Data processing systems (e.g., Spark) have a number of resource configuration parameters, such a...
Thesis (Ph.D.)--University of Washington, 2018Large-scale data analytics is key to modern science, t...
In cloud systems, computation time can be rented by the hour and for a given number of processors. T...
Improving the efficiency of big cloud providers has become a very difficult task. The great quantity...
One of the fundamental machine learning tasks is that of predictive classification. Given that organ...
Estimates of task runtime, disk space usage, and memory consumption, are commonly used by scheduling...
Large organizations have seamlessly incorporated data-driven decision making in their operations. Ho...
Training classifiers on large databases is computationally demand-ing. It is desirable to develop ef...
Several activities of Web-based architectures are managed by algorithms that take runtime decisions ...
Given the rapid rise in energy demand by data centers and computing systems in general, it is fundam...