We describe a method of inferring join plans for a set of relation instances, in the absence of any metadata, such as attribute domains, attribute names, or constraints (e.g., keys or foreign keys). Our method enumerates the possible join plans in order of likelihood, based on the compatibility of a pair of columns and their suitability as join attributes (i.e. their appropriateness as keys). We outline two variants of the approach. The first variant is accurate but potentially time-consuming, especially for large relations that do not fit in memory. The second variant is an approximation of the former and hence less accurate, but is considerably more efficient, allowing the method to be used online, even for large relations. We provide exp...
National audienceSpecifying join predicates may become a cumbersome task in many situations e.g., wh...
Relational databases have been utilized in a variety of applications, such as airline reservation sy...
Machine Learning (ML) applications require high-quality datasets. Automated data augmentation techni...
We describe a method of inferring join plans for a set of relation instances, in the absence of any ...
Techniques for identifying joinable or unionable tables in data lakes can yield valuable information...
International audienceWe investigate the problem of learning join queries from user examples. The us...
Join is a powerful operator that combines records from two or more tables, which is of fundamental i...
International audienceWe investigate the problem of inferring join queries from user interactions. T...
Abstract. A join of two relations in real databases is usually much smaller than their Cartesian pro...
Two new algorithms, "Jive-join'" and "Slam-join," are proposed for computing the join of two relatio...
We investigate the problem of inferring join queries from user interactions. The user is presented w...
The join operation combines information from multiple data sources. Efficient processing of join que...
We present three novel algorithms for performing multi-dimensional joins and an in-depth survey and ...
Databases contain information about which relationships do and do not hold among entities. To make t...
In this paper, we propose a search-based approach to join two tables in the absence of clean join at...
National audienceSpecifying join predicates may become a cumbersome task in many situations e.g., wh...
Relational databases have been utilized in a variety of applications, such as airline reservation sy...
Machine Learning (ML) applications require high-quality datasets. Automated data augmentation techni...
We describe a method of inferring join plans for a set of relation instances, in the absence of any ...
Techniques for identifying joinable or unionable tables in data lakes can yield valuable information...
International audienceWe investigate the problem of learning join queries from user examples. The us...
Join is a powerful operator that combines records from two or more tables, which is of fundamental i...
International audienceWe investigate the problem of inferring join queries from user interactions. T...
Abstract. A join of two relations in real databases is usually much smaller than their Cartesian pro...
Two new algorithms, "Jive-join'" and "Slam-join," are proposed for computing the join of two relatio...
We investigate the problem of inferring join queries from user interactions. The user is presented w...
The join operation combines information from multiple data sources. Efficient processing of join que...
We present three novel algorithms for performing multi-dimensional joins and an in-depth survey and ...
Databases contain information about which relationships do and do not hold among entities. To make t...
In this paper, we propose a search-based approach to join two tables in the absence of clean join at...
National audienceSpecifying join predicates may become a cumbersome task in many situations e.g., wh...
Relational databases have been utilized in a variety of applications, such as airline reservation sy...
Machine Learning (ML) applications require high-quality datasets. Automated data augmentation techni...