In this paper, we first employ the well known Cover-Coefficient Based Clustering Methodology (C3 M) for clustering XML documents. Next, we apply index pruning techniques from the literature to reduce the size of the document vectors. Our experiments show that for certain cases, it is possible to prune up to 70% of the collection (or, more specifically, underlying document vectors) and still generate a clustering structure that yields the same quality with that of the original collection, in terms of a set of evaluation metrics
In this paper, we study the problem of indexing an XML database. Existing XML indexing techniques fo...
AbstractClustering XML documents is extensively used to organize large collections of XML documents ...
With the standardization of XML as an information exchange language over the net, a huge amount of i...
In this paper, we first employ the well known Cover-Coefficient Based Clustering Methodology (C3M) f...
Abstract: Problem statement: To improve the performance of data retrieval in a homogeneous large XML...
In the last few years we have observed a proliferation of approaches for clustering XML docu-\ud men...
Abstract—With the standardization of XML as an information exchange language over the net, a huge am...
This paper reports on the experiments and results of a clustering approach used in the INEX 2008 doc...
With the increasing use of XML in many domains, XML document clustering has been a central research ...
Abstract. This work explores the application of clustering methods for grouping structurally similar...
In this paper, we describe a new bitmap indexing based technique to cluster XML documents. XML is a ...
@inproceedings{AI-DOUCET-2002, author = {Doucet, A. and Ahonen-Myka, H.}, title = {Naive clustering ...
With the vastly growing data resources on the Internet, XML is one of the most important standards f...
This paper presents the incremental clustering algorithm, XML documents Clustering with Level Simila...
XML has become main data format in e-Business, e-Learning, e-Commerce, the need for tools to help ma...
In this paper, we study the problem of indexing an XML database. Existing XML indexing techniques fo...
AbstractClustering XML documents is extensively used to organize large collections of XML documents ...
With the standardization of XML as an information exchange language over the net, a huge amount of i...
In this paper, we first employ the well known Cover-Coefficient Based Clustering Methodology (C3M) f...
Abstract: Problem statement: To improve the performance of data retrieval in a homogeneous large XML...
In the last few years we have observed a proliferation of approaches for clustering XML docu-\ud men...
Abstract—With the standardization of XML as an information exchange language over the net, a huge am...
This paper reports on the experiments and results of a clustering approach used in the INEX 2008 doc...
With the increasing use of XML in many domains, XML document clustering has been a central research ...
Abstract. This work explores the application of clustering methods for grouping structurally similar...
In this paper, we describe a new bitmap indexing based technique to cluster XML documents. XML is a ...
@inproceedings{AI-DOUCET-2002, author = {Doucet, A. and Ahonen-Myka, H.}, title = {Naive clustering ...
With the vastly growing data resources on the Internet, XML is one of the most important standards f...
This paper presents the incremental clustering algorithm, XML documents Clustering with Level Simila...
XML has become main data format in e-Business, e-Learning, e-Commerce, the need for tools to help ma...
In this paper, we study the problem of indexing an XML database. Existing XML indexing techniques fo...
AbstractClustering XML documents is extensively used to organize large collections of XML documents ...
With the standardization of XML as an information exchange language over the net, a huge amount of i...