With the increasing adoption of distributed systems in both academia and industry, and with the increasing computational and storage requirements of distributed applications, users inevitably demand more from these systems. Moreover, users also depend on these systems for latency and throughput sensitive applications, such as interactive perception applications and MapReduce applications, which make the performance of these systems even more important. Therefore, for the users it is very important that distributed systems provide consistent performance, that is, the system provides a similar level of performance at all times. In this thesis we address the problem of understanding and improving the performance consistency of state-of-the-art...
Big Data such as Terabyte and Petabyte datasets are rapidly becoming the new norm for various organi...
Although most current cloud providers, such as Amazon Web Services (AWS) and Microsoft Azure offer d...
Grids consist of both dedicated and non-dedicated clusters. For effective mapping of parallel applic...
With the increasing adoption of distributed systems in both academia and industry, and with the incr...
Loosely coupled applications composed of a potentially very large number (from tens of thousands to ...
Today’s challenging problems in science and industry are solved by complex data-driven al...
Ever more scientists are employing large-scale distributed systems such as grids for their computati...
Due to the growing size of compute clusters, large scale parallel applications increasingly have to ...
Highly variable parallel application execution time is a persistent issue in cluster computing envir...
Loosely coupled applications composed of a potentially very large number (from tens of thousands to ...
This paper presents a comprehensive statistical analysis of a variety of workloads collected on prod...
For decades distributed computing has been mainly an academic subject. Today, it has become mainstre...
Many organizations routinely analyze large datasets using systems for distributed data-parallel proc...
In Grid applications the heterogeneity and potential failures of the computing infrastructure poses ...
The paper discusses our practical experience and theoretical results of investigating the impact of ...
Big Data such as Terabyte and Petabyte datasets are rapidly becoming the new norm for various organi...
Although most current cloud providers, such as Amazon Web Services (AWS) and Microsoft Azure offer d...
Grids consist of both dedicated and non-dedicated clusters. For effective mapping of parallel applic...
With the increasing adoption of distributed systems in both academia and industry, and with the incr...
Loosely coupled applications composed of a potentially very large number (from tens of thousands to ...
Today’s challenging problems in science and industry are solved by complex data-driven al...
Ever more scientists are employing large-scale distributed systems such as grids for their computati...
Due to the growing size of compute clusters, large scale parallel applications increasingly have to ...
Highly variable parallel application execution time is a persistent issue in cluster computing envir...
Loosely coupled applications composed of a potentially very large number (from tens of thousands to ...
This paper presents a comprehensive statistical analysis of a variety of workloads collected on prod...
For decades distributed computing has been mainly an academic subject. Today, it has become mainstre...
Many organizations routinely analyze large datasets using systems for distributed data-parallel proc...
In Grid applications the heterogeneity and potential failures of the computing infrastructure poses ...
The paper discusses our practical experience and theoretical results of investigating the impact of ...
Big Data such as Terabyte and Petabyte datasets are rapidly becoming the new norm for various organi...
Although most current cloud providers, such as Amazon Web Services (AWS) and Microsoft Azure offer d...
Grids consist of both dedicated and non-dedicated clusters. For effective mapping of parallel applic...