In this paper, we first employ the well known Cover-Coefficient Based Clustering Methodology (C3M) for clustering XML documents. Next, we apply index pruning techniques from the literature to reduce the size of the document vectors. Our experiments show that for certain cases, it is possible to prune up to 70% of the collection (or, more specifically, underlying document vectors) and still generate a clustering structure that yields the same quality with that of the original collection, in terms of a set of evaluation metrics. © 2010 Springer-Verlag Berlin Heidelberg
This paper presents the incremental clustering algorithm, XML documents Clustering with Level Simila...
In this paper, we study the problem of indexing an XML database. Existing XML indexing techniques fo...
International audienceThis article is a report concerning the two years of the XML Mining track at I...
In this paper, we first employ the well known Cover-Coefficient Based Clustering Methodology (C3 M) ...
Abstract: Problem statement: To improve the performance of data retrieval in a homogeneous large XML...
In the last few years we have observed a proliferation of approaches for clustering XML docu- ments ...
This paper describes the approach taken to the XML Mining track at INEX 2008 by a group at the Queen...
In the last few years we have observed a proliferation of approaches for clustering XML documents an...
This paper reports on the experiments and results of a clustering approach used in the INEX 2008 doc...
With the standardization of XML as an information exchange language over the net, a huge amount of i...
Large volume of information is stored in XML format in the Web, and clustering is a management metho...
Abstract—With the standardization of XML as an information exchange language over the net, a huge am...
With the increasing use of XML in many domains, XML document clustering has been a central research ...
This paper proposes a clustering approach that explores both the content and the structure of XML do...
International audienceThis paper reports our experiments carried out for the INEX XML Mining track, ...
This paper presents the incremental clustering algorithm, XML documents Clustering with Level Simila...
In this paper, we study the problem of indexing an XML database. Existing XML indexing techniques fo...
International audienceThis article is a report concerning the two years of the XML Mining track at I...
In this paper, we first employ the well known Cover-Coefficient Based Clustering Methodology (C3 M) ...
Abstract: Problem statement: To improve the performance of data retrieval in a homogeneous large XML...
In the last few years we have observed a proliferation of approaches for clustering XML docu- ments ...
This paper describes the approach taken to the XML Mining track at INEX 2008 by a group at the Queen...
In the last few years we have observed a proliferation of approaches for clustering XML documents an...
This paper reports on the experiments and results of a clustering approach used in the INEX 2008 doc...
With the standardization of XML as an information exchange language over the net, a huge amount of i...
Large volume of information is stored in XML format in the Web, and clustering is a management metho...
Abstract—With the standardization of XML as an information exchange language over the net, a huge am...
With the increasing use of XML in many domains, XML document clustering has been a central research ...
This paper proposes a clustering approach that explores both the content and the structure of XML do...
International audienceThis paper reports our experiments carried out for the INEX XML Mining track, ...
This paper presents the incremental clustering algorithm, XML documents Clustering with Level Simila...
In this paper, we study the problem of indexing an XML database. Existing XML indexing techniques fo...
International audienceThis article is a report concerning the two years of the XML Mining track at I...