Abstract Similarity between XML documents can be studied in different ways, in particular depending of the fact that we take into consideration only the structure of the document, its content, or both of them. In this paper we choose a structure-based approach, based on the fuzzyfication of the XML documents representing instances of a given XML data model, belonging to a common domain of interest. After a process of data flow analysis we obtain similarity values between documents, used to create similarity classes representing the domain. Using a clustering technique and a graph based approach it is possible build a taxonomy that can be represented through an ontology language
This paper proposes a novel approach to measuring XML document similarity by taking into account the...
This paper reports on the experiments and results of a clustering approach used in the INEX 2008 doc...
This chapter discusses existing approaches to evaluate and measure structural similarity in sources ...
In this work we propose a fuzzy technique to compare XML documents belonging to a semi-structured fl...
XML is the new standard for information exchange and retrieval. As XML material becomes more abundan...
In this paper, we describe a technique for extracting patterns to a XML data flow; then, we show how...
This paper proposes a clustering approach that explores both the content and the structure of XML do...
In this paper, we describe a technique for extracting patterns to a XML data flow; then, we show how...
The large amount and heterogeneity of XML documents on the Web require the development of clustering...
Abstract. Every day more digital data in semi-structured format are available on the World Wide Web,...
This paper presents the incremental clustering algorithm, XML documents Clustering with Level Simila...
XML has gained popularity for information representation, exchange and retrieval. As XML material be...
With the growing popularity of XML as the data representation language, collections of the XML data ...
This chapter discusses existing approaches to evaluate and measure structural similarity in sources ...
In the last few years we have observed a proliferation of approaches for clustering XML docu-\ud men...
This paper proposes a novel approach to measuring XML document similarity by taking into account the...
This paper reports on the experiments and results of a clustering approach used in the INEX 2008 doc...
This chapter discusses existing approaches to evaluate and measure structural similarity in sources ...
In this work we propose a fuzzy technique to compare XML documents belonging to a semi-structured fl...
XML is the new standard for information exchange and retrieval. As XML material becomes more abundan...
In this paper, we describe a technique for extracting patterns to a XML data flow; then, we show how...
This paper proposes a clustering approach that explores both the content and the structure of XML do...
In this paper, we describe a technique for extracting patterns to a XML data flow; then, we show how...
The large amount and heterogeneity of XML documents on the Web require the development of clustering...
Abstract. Every day more digital data in semi-structured format are available on the World Wide Web,...
This paper presents the incremental clustering algorithm, XML documents Clustering with Level Simila...
XML has gained popularity for information representation, exchange and retrieval. As XML material be...
With the growing popularity of XML as the data representation language, collections of the XML data ...
This chapter discusses existing approaches to evaluate and measure structural similarity in sources ...
In the last few years we have observed a proliferation of approaches for clustering XML docu-\ud men...
This paper proposes a novel approach to measuring XML document similarity by taking into account the...
This paper reports on the experiments and results of a clustering approach used in the INEX 2008 doc...
This chapter discusses existing approaches to evaluate and measure structural similarity in sources ...