Maintaining strictness in dimensions is important in integration of data warehouses. A dimension that satisfies all of its roll-up constraints is said to be strict, a property that is required for correct aggregation. Existing work on instance matching does not address the problem of enforcing the strictness of roll-up constraints. In this paper, we use a graph matching-based approach to dimension instance matching and propose an algorithm that enforces strictness and reduces false positives. Making use of similarity flooding, the graph matching algorithm can be greedy in identifying matching members, we propose heuristics to further reduce false positive matches and reduce false strictness. Experiments on real-world data demonstrates the e...
Following recent trends in Data Warehousing, companies realized that there is a great potential in c...
A dimension in a data warehouse (DW) is an abstract concept that groups data that share a common sem...
Matching elements of two data schemas or two data instances plays a key role in data warehousing, e-...
The problem of integrating heterogeneous data marts is an important problemin building enterprise da...
Abstract—Dimensions in Data Warehouses (DWs) are set of elements connected by a hierarchical relatio...
Abstract—Dimensions in Data Warehouses (DWs) are usually modeled as a hierarchical set of categories...
Abstract. Data warehouses (DWs) can become inconsistent when some dimensional constraints are not sa...
A Data Warehouse (DW) is a data repository that organizes and physically integrates data from multip...
Abstract. A Data Warehouse (DW) is a data repository that organizes and phys-ically integrates data ...
Abstract. On-Line Analytical Processing (OLAP) dimensions are usually mod-elled as a hierarchical se...
On-Line Analytical Processing (OLAP) dimensions are usually modelled as a hierarchical set of catego...
Abstract. The similarity join is an important database primitive which has been successfully applied...
Record-level matching rules are chains of similarity join pred-icates on multiple attributes employe...
During the last decades, the Data Warehouse has been one of the main components of a Decision Suppor...
In this paper we address the problem of integrating independent and possibly heterogeneous data ware...
Following recent trends in Data Warehousing, companies realized that there is a great potential in c...
A dimension in a data warehouse (DW) is an abstract concept that groups data that share a common sem...
Matching elements of two data schemas or two data instances plays a key role in data warehousing, e-...
The problem of integrating heterogeneous data marts is an important problemin building enterprise da...
Abstract—Dimensions in Data Warehouses (DWs) are set of elements connected by a hierarchical relatio...
Abstract—Dimensions in Data Warehouses (DWs) are usually modeled as a hierarchical set of categories...
Abstract. Data warehouses (DWs) can become inconsistent when some dimensional constraints are not sa...
A Data Warehouse (DW) is a data repository that organizes and physically integrates data from multip...
Abstract. A Data Warehouse (DW) is a data repository that organizes and phys-ically integrates data ...
Abstract. On-Line Analytical Processing (OLAP) dimensions are usually mod-elled as a hierarchical se...
On-Line Analytical Processing (OLAP) dimensions are usually modelled as a hierarchical set of catego...
Abstract. The similarity join is an important database primitive which has been successfully applied...
Record-level matching rules are chains of similarity join pred-icates on multiple attributes employe...
During the last decades, the Data Warehouse has been one of the main components of a Decision Suppor...
In this paper we address the problem of integrating independent and possibly heterogeneous data ware...
Following recent trends in Data Warehousing, companies realized that there is a great potential in c...
A dimension in a data warehouse (DW) is an abstract concept that groups data that share a common sem...
Matching elements of two data schemas or two data instances plays a key role in data warehousing, e-...