A proliferation of frameworks have emerged to handle the challenges of making distributed computations reliable and scalable. These enable users to easily perform analysis of large datasets on commodity clusters. As users have demanded better response times for these computations, newer versions of these frameworks have focused on efficiently keeping computation in memory. A major challenge in deploying such frameworks is understanding application memory requirements. Just as the layers of abstraction of frameworks assist in writing efficient and robust applications, they hide the true memory requirements.In this dissertation, I describe and evaluate SLAMR, a tool I have developed for providing users with memory recommendations for progr...
This paper presents ESTIMA, an easy-to-use tool for extrapolating the scalability of in-memory appli...
Motivated by the increasing popularity of hosting in-memory big-data analytics in cloud, we present ...
Memory bloat is loosely defined as an excessive memory usage by an application during its execution....
The sheer increase in the volume of data over the last decade has triggered research in cluster comp...
There has been much research devoted to improving the performance of data analytics frameworks, but ...
In recent years the High Performance Computing (HPC) industry has benefited from the development of ...
Big data analytics frameworks, such as Spark and Giraph, need to process and cache massive amounts o...
In last decade, data analytics have rapidly progressed from traditional disk-based processing to mod...
Hadoop provides a scalable solution on traditional cluster-based Big Data platforms but imposes per...
Sheer increase in volume of data over the last decade has triggered research in cluster computing fr...
While cluster computing frameworks are contin-uously evolving to provide real-time data analysis cap...
Though the performance of many applications is dominated by memory behavior, our ability to describe...
As computing efficiency becomes constrained by hardware scaling limitations, code optimization grows...
(Under the direction of Assistant Professor Dr. Frank Mueller). Over recent decades, computing speed...
Planning optimized memory management is critical for Big Data analysis tools to perform faster runti...
This paper presents ESTIMA, an easy-to-use tool for extrapolating the scalability of in-memory appli...
Motivated by the increasing popularity of hosting in-memory big-data analytics in cloud, we present ...
Memory bloat is loosely defined as an excessive memory usage by an application during its execution....
The sheer increase in the volume of data over the last decade has triggered research in cluster comp...
There has been much research devoted to improving the performance of data analytics frameworks, but ...
In recent years the High Performance Computing (HPC) industry has benefited from the development of ...
Big data analytics frameworks, such as Spark and Giraph, need to process and cache massive amounts o...
In last decade, data analytics have rapidly progressed from traditional disk-based processing to mod...
Hadoop provides a scalable solution on traditional cluster-based Big Data platforms but imposes per...
Sheer increase in volume of data over the last decade has triggered research in cluster computing fr...
While cluster computing frameworks are contin-uously evolving to provide real-time data analysis cap...
Though the performance of many applications is dominated by memory behavior, our ability to describe...
As computing efficiency becomes constrained by hardware scaling limitations, code optimization grows...
(Under the direction of Assistant Professor Dr. Frank Mueller). Over recent decades, computing speed...
Planning optimized memory management is critical for Big Data analysis tools to perform faster runti...
This paper presents ESTIMA, an easy-to-use tool for extrapolating the scalability of in-memory appli...
Motivated by the increasing popularity of hosting in-memory big-data analytics in cloud, we present ...
Memory bloat is loosely defined as an excessive memory usage by an application during its execution....