The availability of summary data for XML documents has many applications, from providing users with quick feedback about their queries, to cost-based storage design and query optimization. StatiX is a novel XML Schema-aware statistics framework that exploits the structure derived by regular expressions (which define elements in an XML Schema) to pinpoint places in the schema that are likely sources of structural skew. As we discuss below, this information can be used to build concise, yet accurate, statistical summaries for XML data. StatiX leverages standard XML technology for gathering statistics, notably XML Schema validators, and it uses histograms to summarize both the structure and values in an XML document. In this paper we describe ...
There is currently a lot of interest in developing Internet query processors that can pose elaborate...
The nature of semistructured data in web collections is evolving. Even when XML web documents are va...
XML schema analysis aims to extract quantitative and qualitative information from actual XML schemas...
The availability of summary data for XML documents has many applications, from providing users with ...
We tackle the problem of obtaining statistics on content and structure of XML documents by using sum...
There have been several techniques proposed for building statistics for static XML data. However, ve...
In the last few years several repositories for storing XML documents and languages for querying XML...
Current approaches for estimating the cardinality of XML queries are applicable to a static scenario...
An XML summary should enable cardinality estimations of different kinds on an XML document to flexib...
Summarization: Effective support for XML query languages is becoming increasingly important with the...
The XML standards that are currently emerging have a number of characteristics that can also be foun...
Abstract. Recently XML has achieved the leading role among lan-guages for data representation and th...
Journal ArticleCurrent approaches for estimating the cardinality of XML queries are applicable to a...
In the last few years several repositories for storing XML documents and languages for querying XML ...
Summarization: XML is rapidly emerging as the new standard for data representation and exchange on t...
There is currently a lot of interest in developing Internet query processors that can pose elaborate...
The nature of semistructured data in web collections is evolving. Even when XML web documents are va...
XML schema analysis aims to extract quantitative and qualitative information from actual XML schemas...
The availability of summary data for XML documents has many applications, from providing users with ...
We tackle the problem of obtaining statistics on content and structure of XML documents by using sum...
There have been several techniques proposed for building statistics for static XML data. However, ve...
In the last few years several repositories for storing XML documents and languages for querying XML...
Current approaches for estimating the cardinality of XML queries are applicable to a static scenario...
An XML summary should enable cardinality estimations of different kinds on an XML document to flexib...
Summarization: Effective support for XML query languages is becoming increasingly important with the...
The XML standards that are currently emerging have a number of characteristics that can also be foun...
Abstract. Recently XML has achieved the leading role among lan-guages for data representation and th...
Journal ArticleCurrent approaches for estimating the cardinality of XML queries are applicable to a...
In the last few years several repositories for storing XML documents and languages for querying XML ...
Summarization: XML is rapidly emerging as the new standard for data representation and exchange on t...
There is currently a lot of interest in developing Internet query processors that can pose elaborate...
The nature of semistructured data in web collections is evolving. Even when XML web documents are va...
XML schema analysis aims to extract quantitative and qualitative information from actual XML schemas...