XML is becoming increasingly popular as a language for representing many types of electronic documents. The consequence of the strict structural document description via XML is that a relatively new task in mining documents based on structural and/or content information has emerged. In this paper we investigate (1) the suitability of new unsupervised machine learning methods for the clustering task of XML documents, and (2) the importance of contextual information for the same task. These tasks are part of an international competition on XML clustering and categorization (INEX 2006). It will be shown that the proposed approaches provide a suitable tool for the clustering of structured data as they yield the best results in the international...
This paper presents the incremental clustering algorithm, XML documents Clustering with Level Simila...
XML is the new standard for information exchange and retrieval. As XML material becomes more abundan...
The processing and management of XML data are popular research issues. However, operations based on ...
XML is becoming increasingly popular as a language for representing many types of electronic documen...
The number of XML documents produced and available on the Internet is steadily increasing. It is thu...
This article is a report concerning the two years of the XML Mining track at INEX (2005 and 2006). W...
Self-Organizing Maps capable of encoding structured information will be used for the clustering of X...
Self-Organizing Maps capable of encoding structured information will be used for the clustering of X...
With the increasing use of XML in many domains, XML document clustering has been a central research ...
Abstract—With the standardization of XML as an information exchange language over the net, a huge am...
International audienceThis article is a report concerning the two years of the XML Mining track at I...
This paper reports on the experiments and results of a clustering approach used in the INEX 2008 doc...
XML has become a popular method of data representation both on the web and in databases in recent ye...
The XML Document Mining track was launched for exploring two main ideas: (1) identifying key problem...
XML is touted as the breakthrough in data exchange on the web. As XML material becomes more abundant...
This paper presents the incremental clustering algorithm, XML documents Clustering with Level Simila...
XML is the new standard for information exchange and retrieval. As XML material becomes more abundan...
The processing and management of XML data are popular research issues. However, operations based on ...
XML is becoming increasingly popular as a language for representing many types of electronic documen...
The number of XML documents produced and available on the Internet is steadily increasing. It is thu...
This article is a report concerning the two years of the XML Mining track at INEX (2005 and 2006). W...
Self-Organizing Maps capable of encoding structured information will be used for the clustering of X...
Self-Organizing Maps capable of encoding structured information will be used for the clustering of X...
With the increasing use of XML in many domains, XML document clustering has been a central research ...
Abstract—With the standardization of XML as an information exchange language over the net, a huge am...
International audienceThis article is a report concerning the two years of the XML Mining track at I...
This paper reports on the experiments and results of a clustering approach used in the INEX 2008 doc...
XML has become a popular method of data representation both on the web and in databases in recent ye...
The XML Document Mining track was launched for exploring two main ideas: (1) identifying key problem...
XML is touted as the breakthrough in data exchange on the web. As XML material becomes more abundant...
This paper presents the incremental clustering algorithm, XML documents Clustering with Level Simila...
XML is the new standard for information exchange and retrieval. As XML material becomes more abundan...
The processing and management of XML data are popular research issues. However, operations based on ...