Abstract. The materialized view selection is a non-trivial task. Hence, its complexity must be reduced. A judicious choice of views must be cost-driven and influenced by the workload experienced by the system. In this paper, we propose a framework for materialized view selection that exploits a data mining technique (clustering), in order to determine clusters of similar queries. We also propose a view merging process that builds a set of candidate views, as well as a greedy process for selecting a set of views to materialize. This selection is based on cost models that evaluate the cost of accessing data using views and the cost of storing these views. To validate our strategy, we executed a workload of decision-support queries on a test d...
In this paper, we describe the design of a data warehousing system for an engineering company ‘R’. T...
In this paper we propose an approach, which is based on Markov Chain, to cluster and recommend candi...
Data warehouse is a repository of large amount of data collected from multiple heterogeneous and dis...
International audienceMaterialized views and indexes are physical structures for accelerating data a...
This is a draft of my contribution to a book chapter (doi: 10.4018/978-1-60566-816-1.ch005). We main...
Abstract. Queries to data warehouses often involve hundreds of complex aggregations over large volum...
Selecting views to materialize impacts on the efficiency as well as the total cost of establishing a...
A data warehouse uses multiple materialized views to efficiently process a given set of queries. Mat...
A data warehouse efficiently processes a given set of queries by utilizing the multiple materialized...
A data warehouse uses multiple materialized views to efficiently process a given set of queries. The...
International audienceThe aim of this article is to present an overview of the major families of sta...
Decision support systems issue a large number of online analytical processing (OLAP) queries to acce...
The use of materialized views in a data warehouse installation is a common tool to speed up mostly a...
International audienceThere are many motivations for investigating the view selection problem. At fi...
In order to facilitate query processing, the information contained in data warehouses is typically s...
In this paper, we describe the design of a data warehousing system for an engineering company ‘R’. T...
In this paper we propose an approach, which is based on Markov Chain, to cluster and recommend candi...
Data warehouse is a repository of large amount of data collected from multiple heterogeneous and dis...
International audienceMaterialized views and indexes are physical structures for accelerating data a...
This is a draft of my contribution to a book chapter (doi: 10.4018/978-1-60566-816-1.ch005). We main...
Abstract. Queries to data warehouses often involve hundreds of complex aggregations over large volum...
Selecting views to materialize impacts on the efficiency as well as the total cost of establishing a...
A data warehouse uses multiple materialized views to efficiently process a given set of queries. Mat...
A data warehouse efficiently processes a given set of queries by utilizing the multiple materialized...
A data warehouse uses multiple materialized views to efficiently process a given set of queries. The...
International audienceThe aim of this article is to present an overview of the major families of sta...
Decision support systems issue a large number of online analytical processing (OLAP) queries to acce...
The use of materialized views in a data warehouse installation is a common tool to speed up mostly a...
International audienceThere are many motivations for investigating the view selection problem. At fi...
In order to facilitate query processing, the information contained in data warehouses is typically s...
In this paper, we describe the design of a data warehousing system for an engineering company ‘R’. T...
In this paper we propose an approach, which is based on Markov Chain, to cluster and recommend candi...
Data warehouse is a repository of large amount of data collected from multiple heterogeneous and dis...