This thesis explores methods to find features that affect the start time of mobile apps. To help app developers improve performance over patch cycles, we implement an alarm pipeline in Apache Spark and tested it. This implementation is able to detect notable changes in the start time distribution and alert the developer. Spark is a scalable cluster computing framework, that proved well suited for the given problem. The program consists of five steps; preprocessing the data, fitting a Gaussian mixture model (GMM) to the data, and finding differences in the distributions. Then a linear regression model is fit to map the available features to the GMM parametrization. The linear regression enables the developer to find relationships between the...
Predicting the performance of cloud services is intrinsically hard. In this work, we pursue an appro...
Unlike traditional computing platforms (such as desktops and servers), mobile platforms provide a v...
To analyze large-scale data efficiently, developers have created various big data processing framewo...
Apache Spark is being increasingly used to execute big data applications on cluster computing platfo...
The pace and volume of code churn necessary to evolve modern software systems present challenges for...
Spark is an in-memory framework for implementing distributed applications of various types. Predicti...
The complexity of resource usage and power consumption on cloud-based applications makes the underst...
Cloud computing solutions provide applications, storage capabilities, and computational resources th...
Thesis: Ph. D., Massachusetts Institute of Technology, Department of Electrical Engineering and Comp...
Interactive mobile applications attract lots of attentions recently. They utilize complex algorithms...
Abstract—Due to the proliferation of mobile applications (abbreviated as Apps) on smart phones, user...
Due to the latest development in the context of Internet of Things, the amount of generated and coll...
In many systems log information concerning event timing is available. Information about the performa...
Big data Hadoop and Spark applications are deployed on infrastructure managed by resource managers s...
We contribute by quantifying the effect of network latency and battery consumption on mobile app per...
Predicting the performance of cloud services is intrinsically hard. In this work, we pursue an appro...
Unlike traditional computing platforms (such as desktops and servers), mobile platforms provide a v...
To analyze large-scale data efficiently, developers have created various big data processing framewo...
Apache Spark is being increasingly used to execute big data applications on cluster computing platfo...
The pace and volume of code churn necessary to evolve modern software systems present challenges for...
Spark is an in-memory framework for implementing distributed applications of various types. Predicti...
The complexity of resource usage and power consumption on cloud-based applications makes the underst...
Cloud computing solutions provide applications, storage capabilities, and computational resources th...
Thesis: Ph. D., Massachusetts Institute of Technology, Department of Electrical Engineering and Comp...
Interactive mobile applications attract lots of attentions recently. They utilize complex algorithms...
Abstract—Due to the proliferation of mobile applications (abbreviated as Apps) on smart phones, user...
Due to the latest development in the context of Internet of Things, the amount of generated and coll...
In many systems log information concerning event timing is available. Information about the performa...
Big data Hadoop and Spark applications are deployed on infrastructure managed by resource managers s...
We contribute by quantifying the effect of network latency and battery consumption on mobile app per...
Predicting the performance of cloud services is intrinsically hard. In this work, we pursue an appro...
Unlike traditional computing platforms (such as desktops and servers), mobile platforms provide a v...
To analyze large-scale data efficiently, developers have created various big data processing framewo...