The processing and management of XML data are popular research issues. However, operations based on the structure of XML data have not received strong attention. These operations involve, among others, the grouping of structurally similar XML documents. Such grouping results from the application of clustering methods with distances that estimate the similarity between tree structures. This paper presents a framework for clustering XML documents by structure. Modeling the XML documents as rooted ordered labeled trees, we study the usage of structural distance metrics in hierarchical clustering algorithms to detect groups of structurally similar XML documents. We suggest the usage of structural summaries for trees to improve the performance o...
In the last few years we have observed a proliferation of approaches for clustering XML docu-\ud men...
With the growing popularity of XML as the data representation language, collections of the XML data ...
Abstract. Every day more digital data in semi-structured format are available on the World Wide Web,...
The processing and management of XML data are popular research issues. However, operations based on ...
This work explores the application of clustering methods for grouping structurally similar XML docum...
Abstract—With the standardization of XML as an information exchange language over the net, a huge am...
This paper presents the incremental clustering algorithm, XML documents Clustering with Level Simila...
With the increasing use of XML in many domains, XML document clustering has been a central research ...
This paper proposes a clustering approach that explores both the content and the structure of XML do...
The large amount and heterogeneity of XML documents on the Web require the development of clustering...
With the standardization of XML as an information exchange language over the net, a huge amount of i...
We propose a methodology for clustering XMLdocuments on the basis of their structuralsimilarities. T...
We present a novel clustering algorithm to group the XML documents by similar structures. We introdu...
XML has become a popular method of data representation both on the web and in databases in recent ye...
With the growing popularity of XML as the data representation language, collections of the XML data ...
In the last few years we have observed a proliferation of approaches for clustering XML docu-\ud men...
With the growing popularity of XML as the data representation language, collections of the XML data ...
Abstract. Every day more digital data in semi-structured format are available on the World Wide Web,...
The processing and management of XML data are popular research issues. However, operations based on ...
This work explores the application of clustering methods for grouping structurally similar XML docum...
Abstract—With the standardization of XML as an information exchange language over the net, a huge am...
This paper presents the incremental clustering algorithm, XML documents Clustering with Level Simila...
With the increasing use of XML in many domains, XML document clustering has been a central research ...
This paper proposes a clustering approach that explores both the content and the structure of XML do...
The large amount and heterogeneity of XML documents on the Web require the development of clustering...
With the standardization of XML as an information exchange language over the net, a huge amount of i...
We propose a methodology for clustering XMLdocuments on the basis of their structuralsimilarities. T...
We present a novel clustering algorithm to group the XML documents by similar structures. We introdu...
XML has become a popular method of data representation both on the web and in databases in recent ye...
With the growing popularity of XML as the data representation language, collections of the XML data ...
In the last few years we have observed a proliferation of approaches for clustering XML docu-\ud men...
With the growing popularity of XML as the data representation language, collections of the XML data ...
Abstract. Every day more digital data in semi-structured format are available on the World Wide Web,...