With the explosion of data and the increasing complexity of data analysis, large-scale data analysis imposes significant challenges in systems design. While current research focuses on scaling out to large clusters, these scale-out solutions introduce a significant amount of overhead. This thesis is motivated by the advance of new I/O technologies such as flash memory. Instead of scaling out, we explore efficient system designs in a single commodity machine with non-uniform memory architecture (NUMA) and scale to large datasets by utilizing commodity solid-state drives (SSDs). This thesis explores the impact of the new I/O technologies on large-scale data analysis. Instead of implementing individual data analysis algorithms for SSDs, we dev...
Scientific workflows are often composed of compute-intensive simulations and data-intensive analysis...
Flash memory promises to revolutionize storage systems because of its massive performance gains, rug...
<p>Statistical analysis of massive array data is becoming indispensable in answering important scien...
As capacity for data collection and storage continues to grow, data analytics requirements regularl...
Fast content-based searches and complex analytics of the vast amount of data collected via social me...
Flash-based key-value systems are widely deployed in today’s data centers for providing high-speed d...
Thesis (S.M.)--Massachusetts Institute of Technology, Dept. of Electrical Engineering and Computer S...
Abstract—Graph analysis performs many random reads and writes, thus these workloads are typically pe...
Thesis: Ph. D., Massachusetts Institute of Technology, Department of Electrical Engineering and Comp...
The longstanding goals of storage system design have been to provide simple abstractions for applica...
The research that stems from my doctoral dissertation focuses on addressing essential challenges in ...
Computing in the last decade has been characterized by the rise of data- intensive scalable computin...
As their prices decline, their storage capacities increase, and their endurance improves, NAND Flash...
the date of receipt and acceptance should be inserted later Abstract Recent graph computation approa...
University of Minnesota Ph.D. dissertation. August 2015. Major: Computer Science. Advisor: David Du...
Scientific workflows are often composed of compute-intensive simulations and data-intensive analysis...
Flash memory promises to revolutionize storage systems because of its massive performance gains, rug...
<p>Statistical analysis of massive array data is becoming indispensable in answering important scien...
As capacity for data collection and storage continues to grow, data analytics requirements regularl...
Fast content-based searches and complex analytics of the vast amount of data collected via social me...
Flash-based key-value systems are widely deployed in today’s data centers for providing high-speed d...
Thesis (S.M.)--Massachusetts Institute of Technology, Dept. of Electrical Engineering and Computer S...
Abstract—Graph analysis performs many random reads and writes, thus these workloads are typically pe...
Thesis: Ph. D., Massachusetts Institute of Technology, Department of Electrical Engineering and Comp...
The longstanding goals of storage system design have been to provide simple abstractions for applica...
The research that stems from my doctoral dissertation focuses on addressing essential challenges in ...
Computing in the last decade has been characterized by the rise of data- intensive scalable computin...
As their prices decline, their storage capacities increase, and their endurance improves, NAND Flash...
the date of receipt and acceptance should be inserted later Abstract Recent graph computation approa...
University of Minnesota Ph.D. dissertation. August 2015. Major: Computer Science. Advisor: David Du...
Scientific workflows are often composed of compute-intensive simulations and data-intensive analysis...
Flash memory promises to revolutionize storage systems because of its massive performance gains, rug...
<p>Statistical analysis of massive array data is becoming indispensable in answering important scien...