Data on the Internet is increasingly presented in XML format. This allows for novel applications that query all this data using some XML query language. All XML query languages use path expressions to navigate through the tree structure of the data. Estimating the selectivity of these path expressions is therefore essential for optimizing queries in these languages. In this paper, we propose two techniques for capturing the structure of complex large-scale XML data as would be handled by Internet-scale applications in a small amount of memory for estimating the selectivity of XML path expressions: summarized path trees and summarized Markov tables. We experimentally demonstrate the accuracy of our proposed techniques, and explore the differ...
XML query languages typically allow the specification of structural patterns of elements. Finding th...
We study the applicability of XML path summaries in the context of current-day XML databases. We fin...
In this paper we present a novel approach for estimating the selectivity of XML twig queries. Such a...
Estimating the selectivity of queries is a crucial problem in database systems. Virtually all databa...
All existing proposals for querying XML (e.g., XQuery) rely on a pattern-specification language that...
As the Extensible Markup Language (XML) rapidly estab-lishes itself as the de facto standard for pre...
Estimating the selectivity of a simple path expression (SPE) is essential for selecting the most eff...
Summarization: Effective support for XML query languages is becoming increasingly important with the...
There is currently a lot of interest in developing Internet query processors that can pose elaborate...
International audienceSemi-structured data sets in the form of XML documents have many practical use...
Extensible Markup Language (XML) has become the de facto standard for data representation, exchange ...
There have been several techniques proposed for building statistics for static XML data. However, ve...
XML has emerged as a new standard for information representation and exchange on the Internet. To ef...
The adoption of XML promises to accelerate construction of systems that integrate dis-tributed, hete...
Query optimization in IBM's System RX, the first truly relational-XML hybrid data management ...
XML query languages typically allow the specification of structural patterns of elements. Finding th...
We study the applicability of XML path summaries in the context of current-day XML databases. We fin...
In this paper we present a novel approach for estimating the selectivity of XML twig queries. Such a...
Estimating the selectivity of queries is a crucial problem in database systems. Virtually all databa...
All existing proposals for querying XML (e.g., XQuery) rely on a pattern-specification language that...
As the Extensible Markup Language (XML) rapidly estab-lishes itself as the de facto standard for pre...
Estimating the selectivity of a simple path expression (SPE) is essential for selecting the most eff...
Summarization: Effective support for XML query languages is becoming increasingly important with the...
There is currently a lot of interest in developing Internet query processors that can pose elaborate...
International audienceSemi-structured data sets in the form of XML documents have many practical use...
Extensible Markup Language (XML) has become the de facto standard for data representation, exchange ...
There have been several techniques proposed for building statistics for static XML data. However, ve...
XML has emerged as a new standard for information representation and exchange on the Internet. To ef...
The adoption of XML promises to accelerate construction of systems that integrate dis-tributed, hete...
Query optimization in IBM's System RX, the first truly relational-XML hybrid data management ...
XML query languages typically allow the specification of structural patterns of elements. Finding th...
We study the applicability of XML path summaries in the context of current-day XML databases. We fin...
In this paper we present a novel approach for estimating the selectivity of XML twig queries. Such a...