This paper introduces LMFAO (Layered Multiple Functional Aggregate Optimization), an in-memory optimization and execution engine for batches of aggregates over the input database. The primary motivation for this work stems from the observation that for a variety of analytics over databases, their data-intensive tasks can be decomposed into group-by aggregates over the join of the input database relations. We exemplify the versatility and competitiveness of LMFAO for a handful of widely used analytics: learning ridge linear regression, classification trees, regression trees, and the structure of Bayesian networks using Chow-Liu trees; and data cubes used for exploration in data warehousing. LMFAO consists of several layers of logical and cod...
Model calibration is a major challenge faced by the plethora of statistical analytics packages that ...
We study a novel solution to executing aggregation (and specifically COUNT) queries over large-scale...
Data Anlaytic techniques have enhanced human ability to solve a lot of data related problems. It ha...
This paper introduces LMFAO (Layered Multiple Functional Aggregate Optimization), an in-memory optim...
We consider the problem of computing machine learning models over multi-relational databases. The ma...
Large organizations have seamlessly incorporated data-driven decision making in their operations. Ho...
Our goal is to enhance multidimensional database systems with advanced mining primitives. Current On...
Artificial Intelligence workloads have grown in popularity over the last decade, but database query ...
The integration of computers into many facets of our lives has made the collection and storage of st...
Large organizations have seamlessly incorporated data-driven decision making in their operations. Ho...
Aggregations help computing summaries of a data set, which are ubiquitous in various big data analyt...
Large organizations have seamlessly incorporated data-driven decision making in their operations. Ho...
Aggregates are rife in real life SQL queries. However, in the parallel query processing literature a...
Analytical queries virtually always involve aggregation and statistics. SQL offers a wide range of f...
Thesis (Ph.D.)--University of Washington, 2018Large-scale data analytics is key to modern science, t...
Model calibration is a major challenge faced by the plethora of statistical analytics packages that ...
We study a novel solution to executing aggregation (and specifically COUNT) queries over large-scale...
Data Anlaytic techniques have enhanced human ability to solve a lot of data related problems. It ha...
This paper introduces LMFAO (Layered Multiple Functional Aggregate Optimization), an in-memory optim...
We consider the problem of computing machine learning models over multi-relational databases. The ma...
Large organizations have seamlessly incorporated data-driven decision making in their operations. Ho...
Our goal is to enhance multidimensional database systems with advanced mining primitives. Current On...
Artificial Intelligence workloads have grown in popularity over the last decade, but database query ...
The integration of computers into many facets of our lives has made the collection and storage of st...
Large organizations have seamlessly incorporated data-driven decision making in their operations. Ho...
Aggregations help computing summaries of a data set, which are ubiquitous in various big data analyt...
Large organizations have seamlessly incorporated data-driven decision making in their operations. Ho...
Aggregates are rife in real life SQL queries. However, in the parallel query processing literature a...
Analytical queries virtually always involve aggregation and statistics. SQL offers a wide range of f...
Thesis (Ph.D.)--University of Washington, 2018Large-scale data analytics is key to modern science, t...
Model calibration is a major challenge faced by the plethora of statistical analytics packages that ...
We study a novel solution to executing aggregation (and specifically COUNT) queries over large-scale...
Data Anlaytic techniques have enhanced human ability to solve a lot of data related problems. It ha...