Many techniques such as scheduling and resource provisioning rely on performance prediction of workflow tasks for varying input data. However, such estimates are difficult to generate in the cloud. This paper introduces a novel two-stage machine learning approach for predicting workflow task execution times for varying input data in the cloud. In order to achieve high accuracy predictions, our approach relies on parameters reflecting runtime information and two stages of predictions. Empirical results for four real world workflow applications and several commercial cloud providers demonstrate that our approach outperforms existing prediction methods. In our experiments, our approach respectively achieves a best-case and worst-case estimatio...
Spark is an in-memory framework for implementing distributed applications of various types. Predicti...
Estimates of task runtime, disk space usage, and memory consumption, are commonly used by scheduling...
Spark is an in-memory framework for implementing distributed applications of various types. Predicti...
Infrastructure as a service clouds hide the complexity of maintaining the physical infrastructure wi...
Abstract—Scientific workflows, which capture large compu-tational problems, may be executed on large...
The ability to accurately estimate the execution time of computationally expensive e-science algorit...
The ability to accurately estimate the execution time of computationally expensive e-science algorit...
The ability to accurately estimate the execution time of computationally expensive e-science algorit...
The ability to accurately estimate the execution time of computationally expensive e-science algorit...
2019 IEEE. Scientific workflows are complex, resource intensive, dynamic in nature and require elast...
Cloud computing is gaining enormous popularity every day. But with the growing demand of cloud comp...
Cloud computing is gaining enormous popularity every day. But with the growing demand of cloud comp...
Cloud computing provides various types of computing utilities where clients pay for services dependi...
Cloud computing provides various types of computing utilities where clients pay for services dependi...
Background: Service oriented architectures are becoming increasingly popular due to their flexibilit...
Spark is an in-memory framework for implementing distributed applications of various types. Predicti...
Estimates of task runtime, disk space usage, and memory consumption, are commonly used by scheduling...
Spark is an in-memory framework for implementing distributed applications of various types. Predicti...
Infrastructure as a service clouds hide the complexity of maintaining the physical infrastructure wi...
Abstract—Scientific workflows, which capture large compu-tational problems, may be executed on large...
The ability to accurately estimate the execution time of computationally expensive e-science algorit...
The ability to accurately estimate the execution time of computationally expensive e-science algorit...
The ability to accurately estimate the execution time of computationally expensive e-science algorit...
The ability to accurately estimate the execution time of computationally expensive e-science algorit...
2019 IEEE. Scientific workflows are complex, resource intensive, dynamic in nature and require elast...
Cloud computing is gaining enormous popularity every day. But with the growing demand of cloud comp...
Cloud computing is gaining enormous popularity every day. But with the growing demand of cloud comp...
Cloud computing provides various types of computing utilities where clients pay for services dependi...
Cloud computing provides various types of computing utilities where clients pay for services dependi...
Background: Service oriented architectures are becoming increasingly popular due to their flexibilit...
Spark is an in-memory framework for implementing distributed applications of various types. Predicti...
Estimates of task runtime, disk space usage, and memory consumption, are commonly used by scheduling...
Spark is an in-memory framework for implementing distributed applications of various types. Predicti...