Text analysis tools are nowadays required to process increasingly large corpora which are often organized as small files (abstracts, news articles, etc.). Cloud computing offers a convenient, on-demand, pay-as-you-go computing environment for solving such problems. We investigate provisioning on the Amazon EC2 cloud from the user perspective, attempting to provide a scheduling strategy that is both timely and cost effective. We derive an execution plan using an empirically determined application performance model. A first goal of our performance measurements is to determine an optimal file size for our application to consume. Using the subset-sum first fit heuristic we reshape the input data by merging files in order to match as closely as ...
Empirical thesis.Bibliography: pages 75-76.1. Introduction -- 2. Background and related work -- 3. D...
The popularity of Amazon's EC2 cloud platform has increased in commercial and scientific high-perfor...
The scale of scientific applications becomes increasingly large not only in computation, but also in...
Cloud computing provides substantial opportunities to researchers who demand pay-as-you-go computing...
Workflows are used to orchestrate data-intensive applications in many different scientific domains....
Infrastructure-as-a-Service (IaaS) platforms, such as Amazon EC2, allow clients access to massive co...
Cloud Computing provides computing and storage resources at economical price with flexibility, mobil...
Having constantly increasing amounts of data, the analysis of it is often entrusted for a MapReduce ...
Cloud provider Amazon Elastic Compute Cloud (EC2) gives access to resources in the form of virtual s...
Commercial cloud offerings, such as Amazon's EC2, let users allocate compute resources on demand, ch...
Commercial cloud offerings, such as Amazon's EC2, let users allocate compute resources on demand, ch...
If you plan to use Amazon Web Services to run applications in the cloud, the end-to-end approach in ...
This paper presents a cost optimization model for scheduling scientific workflows on IaaS clouds suc...
The exponential growth of data and application complexity has brought new challenges in the distribu...
Cloud computing is an emerging commercial infrastructure paradigm that promises to eliminate the nee...
Empirical thesis.Bibliography: pages 75-76.1. Introduction -- 2. Background and related work -- 3. D...
The popularity of Amazon's EC2 cloud platform has increased in commercial and scientific high-perfor...
The scale of scientific applications becomes increasingly large not only in computation, but also in...
Cloud computing provides substantial opportunities to researchers who demand pay-as-you-go computing...
Workflows are used to orchestrate data-intensive applications in many different scientific domains....
Infrastructure-as-a-Service (IaaS) platforms, such as Amazon EC2, allow clients access to massive co...
Cloud Computing provides computing and storage resources at economical price with flexibility, mobil...
Having constantly increasing amounts of data, the analysis of it is often entrusted for a MapReduce ...
Cloud provider Amazon Elastic Compute Cloud (EC2) gives access to resources in the form of virtual s...
Commercial cloud offerings, such as Amazon's EC2, let users allocate compute resources on demand, ch...
Commercial cloud offerings, such as Amazon's EC2, let users allocate compute resources on demand, ch...
If you plan to use Amazon Web Services to run applications in the cloud, the end-to-end approach in ...
This paper presents a cost optimization model for scheduling scientific workflows on IaaS clouds suc...
The exponential growth of data and application complexity has brought new challenges in the distribu...
Cloud computing is an emerging commercial infrastructure paradigm that promises to eliminate the nee...
Empirical thesis.Bibliography: pages 75-76.1. Introduction -- 2. Background and related work -- 3. D...
The popularity of Amazon's EC2 cloud platform has increased in commercial and scientific high-perfor...
The scale of scientific applications becomes increasingly large not only in computation, but also in...