Data-driven discovery has become critical to the mission of many enterprises and scientific research. At the same time, the rate of data production and collection is outpacing technology scaling, suggesting that significant future investment, time, and energy will be needed for data processing. Straightforwardly increasing hardware resources can address the extra processing needs by either adding more CPU cores/memory (scale-up) or more worker nodes (scale-out). However, it will introduce higher computing cost that may not be feasible when budget is limited. One powerful tool to address the above challenge is approximate computing, which trades off computational time and resources with computational accuracy by reducing the amount of data n...
Over the past decade, the growth of data has been phenomenal. The amount of data which the world acc...
Today, we collect a large amount of data, and the volume of the data we collect is projected to grow...
We study a novel solution to executing aggregation (and specifically COUNT) queries over large-scale...
Today, most modern online services make use of big data analytics systems to extract useful informat...
Thesis (Ph.D.)--University of Washington, 2018Large-scale data analytics is key to modern science, t...
Research has shown that approximate computing is effective at reducing the resource requirements, co...
Large organizations have seamlessly incorporated data-driven decision making in their operations. Ho...
Large organizations have seamlessly incorporated data-driven decision making in their operations. Ho...
From movie recommendations to fraud detection to personalized health care, there is growing need to ...
Modern data analytics applications typically process massive amounts of data on clusters of tens, hu...
Clustering, the task of grouping together similar items, is a frequently used method for processing ...
Data Anlaytic techniques have enhanced human ability to solve a lot of data related problems. It ha...
The ability to analyze large-scale video datasets is useful in an increasing range of applications. ...
Abstract Online analytical processing (OLAP) is a core functionality in database systems. The perfor...
A fast response is critical in many data-intensive applications, including knowledge discovery analy...
Over the past decade, the growth of data has been phenomenal. The amount of data which the world acc...
Today, we collect a large amount of data, and the volume of the data we collect is projected to grow...
We study a novel solution to executing aggregation (and specifically COUNT) queries over large-scale...
Today, most modern online services make use of big data analytics systems to extract useful informat...
Thesis (Ph.D.)--University of Washington, 2018Large-scale data analytics is key to modern science, t...
Research has shown that approximate computing is effective at reducing the resource requirements, co...
Large organizations have seamlessly incorporated data-driven decision making in their operations. Ho...
Large organizations have seamlessly incorporated data-driven decision making in their operations. Ho...
From movie recommendations to fraud detection to personalized health care, there is growing need to ...
Modern data analytics applications typically process massive amounts of data on clusters of tens, hu...
Clustering, the task of grouping together similar items, is a frequently used method for processing ...
Data Anlaytic techniques have enhanced human ability to solve a lot of data related problems. It ha...
The ability to analyze large-scale video datasets is useful in an increasing range of applications. ...
Abstract Online analytical processing (OLAP) is a core functionality in database systems. The perfor...
A fast response is critical in many data-intensive applications, including knowledge discovery analy...
Over the past decade, the growth of data has been phenomenal. The amount of data which the world acc...
Today, we collect a large amount of data, and the volume of the data we collect is projected to grow...
We study a novel solution to executing aggregation (and specifically COUNT) queries over large-scale...