XML is a rather verbose representation of semistructured data, which may require huge amounts of storage space. We propose several summarized representations of XML data, which can both provide succinct information and be directly queried. These representations are based on the extraction of association rules from XML datasets
Due to the lack of efficient native XML database man-agement systems, XML data manipulation and quer...
Extracting information from semistructured documents is a very hard task, and is going to become mor...
The increasing amount of XML datasets available to casual users increases the necessity of investiga...
In the last years XML is becoming a standardized means for representing semistructured data, i.e. da...
XML is a rather verbose representation of semistructured data, which may require huge amounts of sto...
With the sheer amount of data stored, presented and exchanged using XML nowadays, the ability to ext...
In the last few years several repositories for storing XML documents and languages for querying XML ...
XML is a rather verbose representation of semistructured data, which may require huge amounts of sto...
measures, performance measures Data mining algorithms are designed to extract interesting informatio...
In the XML data context, documents may have a "similar" content but a different structure, thus a fl...
The increasing amount of very large XML datasets available to casual users is a challenging problem ...
XML-enabled association rule framework [FDWC03] extends the notion of associated items to XML fragme...
In this work we describe the TreeRuler tool, which makes it possible for inexperienced users to acce...
The use of eXtensible Markup Language (XML) in web, business and scientific databases lead to the de...
We tackle the problem of obtaining statistics on content and structure of XML documents by using sum...
Due to the lack of efficient native XML database man-agement systems, XML data manipulation and quer...
Extracting information from semistructured documents is a very hard task, and is going to become mor...
The increasing amount of XML datasets available to casual users increases the necessity of investiga...
In the last years XML is becoming a standardized means for representing semistructured data, i.e. da...
XML is a rather verbose representation of semistructured data, which may require huge amounts of sto...
With the sheer amount of data stored, presented and exchanged using XML nowadays, the ability to ext...
In the last few years several repositories for storing XML documents and languages for querying XML ...
XML is a rather verbose representation of semistructured data, which may require huge amounts of sto...
measures, performance measures Data mining algorithms are designed to extract interesting informatio...
In the XML data context, documents may have a "similar" content but a different structure, thus a fl...
The increasing amount of very large XML datasets available to casual users is a challenging problem ...
XML-enabled association rule framework [FDWC03] extends the notion of associated items to XML fragme...
In this work we describe the TreeRuler tool, which makes it possible for inexperienced users to acce...
The use of eXtensible Markup Language (XML) in web, business and scientific databases lead to the de...
We tackle the problem of obtaining statistics on content and structure of XML documents by using sum...
Due to the lack of efficient native XML database man-agement systems, XML data manipulation and quer...
Extracting information from semistructured documents is a very hard task, and is going to become mor...
The increasing amount of XML datasets available to casual users increases the necessity of investiga...