This dissertation studies selectivity estimation of approximate predicates on text. Intuitively, we aim to count the number of strings that are similar to a given query string. This type of problem is crucial in handling text in RDBMSs in an error-tolerant way. A common difficulty in handling textual data is that they may contain typographical errors, or use similar but different textual representations for the same real-world entity. To handle such data in databases, approximate text processing has gained extensive interest and commercial databases have begun to incorporate such functionalities. One of the key components in successful integration of approximate text processing in RDBMSs is the selectivity estimation module, which is cent...
String data is ubiquitous, and its management has taken on particular importance in the past few yea...
Accurate cost and time estimation of a query is one of the major success indicators for database man...
International audienceWe investigate the problem of learning join queries from user examples. The us...
This dissertation studies selectivity estimation of approximate predicates on text. Intuitively, we ...
Declarative data quality has been an active research topic. The fundamental principle behind a decla...
Approximate predicates can be used to reduce the number of comparisons made by expensive, complex pr...
Join techniques deploying approximate match predicates are fundamental data cleaning operations. A v...
Selectivity estimation refers to the ability of the SQL query optimizer to estimate the size of the ...
Many application scenarios, e.g., marketing analysis, sensor networks, and medical and biological ap...
Abstract Many application scenarios can significantly benefit from the identification and processing...
Similarity joins are troublesome database operators that often produce results much larger than the ...
"Approximate query answering relies on a similarity measure that evaluates the relevance, for a give...
Approximate queries on string data are important, due to the prevalence of such data in databases an...
2 Many application scenarios, e.g., marketing analysis, sensor networks, and medical and biological ...
Abstract. Text similarity join operator joins two relations if their join attributes are textually s...
String data is ubiquitous, and its management has taken on particular importance in the past few yea...
Accurate cost and time estimation of a query is one of the major success indicators for database man...
International audienceWe investigate the problem of learning join queries from user examples. The us...
This dissertation studies selectivity estimation of approximate predicates on text. Intuitively, we ...
Declarative data quality has been an active research topic. The fundamental principle behind a decla...
Approximate predicates can be used to reduce the number of comparisons made by expensive, complex pr...
Join techniques deploying approximate match predicates are fundamental data cleaning operations. A v...
Selectivity estimation refers to the ability of the SQL query optimizer to estimate the size of the ...
Many application scenarios, e.g., marketing analysis, sensor networks, and medical and biological ap...
Abstract Many application scenarios can significantly benefit from the identification and processing...
Similarity joins are troublesome database operators that often produce results much larger than the ...
"Approximate query answering relies on a similarity measure that evaluates the relevance, for a give...
Approximate queries on string data are important, due to the prevalence of such data in databases an...
2 Many application scenarios, e.g., marketing analysis, sensor networks, and medical and biological ...
Abstract. Text similarity join operator joins two relations if their join attributes are textually s...
String data is ubiquitous, and its management has taken on particular importance in the past few yea...
Accurate cost and time estimation of a query is one of the major success indicators for database man...
International audienceWe investigate the problem of learning join queries from user examples. The us...