The WWW contains a huge amount of documents. Some of them share the subject, but are generated by different people or even organizations. To guarantee the interchange of such documents, we can use XML. This allows to share documents that do not have the same structure. However, it makes difficult to understand the core of such heterogeneous documents (in general, schema is not available). In this paper, we offer a characterization and algorithm to obtain the midpoint (in terms of a resemblance function) of a set of semi-structured, heterogeneous documents without optional elements. The trivial case of midpoint would be the common elements to all documents. Nevertheless, in cases with several heterogeneous documents this may result in an emp...
Currently, XML is still more and more important format for storing and exchanging data. Evaluation o...
This chapter discusses existing approaches to evaluate and measure structural similarity in sources ...
Due to the heterogeneous nature of XML data for internet applications exact matching of queries is o...
The WWW contains a huge amount of documents. Some of them share the subject, but are generated by di...
The WWW contains a huge amount of documents. Some of them share the subject, but are generated by di...
International audienceThe WWW contains a huge amount of documents. Some of them share the subject, b...
The WWW contains a huge amount of documents. Some of them share the same subject, but are generated ...
Sources of XML documents are proliferating on the Web and documents are more and more frequently exc...
In this paper we propose a matching algorithm for measuring the structural similarity between an XML...
The International Conference on Computational Science, ICCS 2003, Melbourne, Australia and St. Peter...
The automatic processing and management of XML-based data are ever more popular research issues due ...
The integration of distributed, heterogeneous information sources has been the topic of intense inve...
Measuring the structural similarity between an XML document and a DTD has many relevant applications...
While the world is witnessing an information revolution unprecedented and great speed in the growth ...
In this paper, we study the problem of measur-ing structural similarities of large number of source ...
Currently, XML is still more and more important format for storing and exchanging data. Evaluation o...
This chapter discusses existing approaches to evaluate and measure structural similarity in sources ...
Due to the heterogeneous nature of XML data for internet applications exact matching of queries is o...
The WWW contains a huge amount of documents. Some of them share the subject, but are generated by di...
The WWW contains a huge amount of documents. Some of them share the subject, but are generated by di...
International audienceThe WWW contains a huge amount of documents. Some of them share the subject, b...
The WWW contains a huge amount of documents. Some of them share the same subject, but are generated ...
Sources of XML documents are proliferating on the Web and documents are more and more frequently exc...
In this paper we propose a matching algorithm for measuring the structural similarity between an XML...
The International Conference on Computational Science, ICCS 2003, Melbourne, Australia and St. Peter...
The automatic processing and management of XML-based data are ever more popular research issues due ...
The integration of distributed, heterogeneous information sources has been the topic of intense inve...
Measuring the structural similarity between an XML document and a DTD has many relevant applications...
While the world is witnessing an information revolution unprecedented and great speed in the growth ...
In this paper, we study the problem of measur-ing structural similarities of large number of source ...
Currently, XML is still more and more important format for storing and exchanging data. Evaluation o...
This chapter discusses existing approaches to evaluate and measure structural similarity in sources ...
Due to the heterogeneous nature of XML data for internet applications exact matching of queries is o...