Materialized views (MVs), stored pre-computed results, are widely used to facilitate fast queries on large datasets. When new records arrive at a high rate, it is infeasible to continuously update (main-tain) MVs and a common solution is to defer maintenance by batch-ing updates together. Between batches the MVs become increas-ingly stale with incorrect, missing, and superfluous rows leading to increasingly inaccurate query results. We propose Stale View Cleaning (SVC) which addresses this problem from a data clean-ing perspective. In SVC, we efficiently clean a sample of rows from a stale MV, and use the clean sample to estimate aggregate query results. While approximate, the estimated query results reflect the most recent data. As samplin...
Updating the materialized views stored in data warehouses usually implies making the warehouse unava...
Materialized views and view maintenance are becoming increasingly important in practice. In order to...
We consider the problem of answering queries using only materialized views. We rst show that if the ...
An important obstacle to accurate data analytics is dirty data in the form of missing, duplicate, in...
Materialized views have been found to be very effective at speeding up queries, and are increasingly...
In emerging Big Data scenarios, obtaining timely, high-quality answers to aggregate queries is diffi...
Semistructured data is not strictly typed like relational or object-oriented data and may be irregul...
Semistructured data is not strictly typed like relational or object-oriented data and may be irregul...
Abstract—Materialized view is used in large data centric applications to expedite query processing. ...
We address the problem of incrementally maintaining a materialized view using the view instance and ...
Data warehousing and on-line analytical processing (OLAP) are essential elements for decision suppor...
We propose a new type of materialized view called a partially materialized view. A partially mate-ri...
Organizations collect a substantial amount of user' data from multiple sources to explore such data ...
Incomplete data is ubiquitous. When a user issues a query over incomplete data, the results may cont...
We consider the problem of speeding up the incremental maintenance of materialized views de ned by c...
Updating the materialized views stored in data warehouses usually implies making the warehouse unava...
Materialized views and view maintenance are becoming increasingly important in practice. In order to...
We consider the problem of answering queries using only materialized views. We rst show that if the ...
An important obstacle to accurate data analytics is dirty data in the form of missing, duplicate, in...
Materialized views have been found to be very effective at speeding up queries, and are increasingly...
In emerging Big Data scenarios, obtaining timely, high-quality answers to aggregate queries is diffi...
Semistructured data is not strictly typed like relational or object-oriented data and may be irregul...
Semistructured data is not strictly typed like relational or object-oriented data and may be irregul...
Abstract—Materialized view is used in large data centric applications to expedite query processing. ...
We address the problem of incrementally maintaining a materialized view using the view instance and ...
Data warehousing and on-line analytical processing (OLAP) are essential elements for decision suppor...
We propose a new type of materialized view called a partially materialized view. A partially mate-ri...
Organizations collect a substantial amount of user' data from multiple sources to explore such data ...
Incomplete data is ubiquitous. When a user issues a query over incomplete data, the results may cont...
We consider the problem of speeding up the incremental maintenance of materialized views de ned by c...
Updating the materialized views stored in data warehouses usually implies making the warehouse unava...
Materialized views and view maintenance are becoming increasingly important in practice. In order to...
We consider the problem of answering queries using only materialized views. We rst show that if the ...