International audienceWe address the problem of learning to map automatically flat and semi-structured documents onto a mediated target XML schema. This problem is motivated by the recent development of applications for searching and mining semi-structured document sources and corpora. Academic research has mainly dealt with homogeneous collections. In practical applications, data come from multiple heterogeneous sources and mining such collections requires defining a mapping or correspondence between the different document formats. Automating the design of such mappings has rapidly become a key issue for these applications. We propose a machine learning approach to this problem where the mapping is learned from pairs of input and correspon...
The increasing amount of XML datasets available to casual users increases the necessity of investiga...
This dissertation studies structured document content reuse problem. In structured document content ...
The Graph Neural Network is a relatively new machine learning method capable of encoding data as wel...
http://www.springerlink.com/International audienceWe address the problem of structure mapping that a...
XML is becoming increasingly popular as a language for representing many types of electronic documen...
Abstract: In this paper we present a system which automatically converts text documents into XML by ...
In this paper we present a novel system for automatically marking up text documents into XML. The sy...
Abstract. The interoperability of heterogeneous data sources is an important is-sue in many applicat...
Self-Organizing Maps capable of encoding structured information will be used for the clustering of X...
Self-Organizing Maps capable of encoding structured information will be used for the clustering of X...
The number of XML documents produced and available on the Internet is steadily increasing. It is thu...
In this paper we present a novel system which automatically converts text documents into XML by extr...
from structured documents. Abstract: Most existing methods for ontology learning from textual docume...
XML schema mappings have been developed and studied in the context of XML data exchange, where a sou...
In the present work we explore application of XML schema mapping in conceptual modeling of XML schem...
The increasing amount of XML datasets available to casual users increases the necessity of investiga...
This dissertation studies structured document content reuse problem. In structured document content ...
The Graph Neural Network is a relatively new machine learning method capable of encoding data as wel...
http://www.springerlink.com/International audienceWe address the problem of structure mapping that a...
XML is becoming increasingly popular as a language for representing many types of electronic documen...
Abstract: In this paper we present a system which automatically converts text documents into XML by ...
In this paper we present a novel system for automatically marking up text documents into XML. The sy...
Abstract. The interoperability of heterogeneous data sources is an important is-sue in many applicat...
Self-Organizing Maps capable of encoding structured information will be used for the clustering of X...
Self-Organizing Maps capable of encoding structured information will be used for the clustering of X...
The number of XML documents produced and available on the Internet is steadily increasing. It is thu...
In this paper we present a novel system which automatically converts text documents into XML by extr...
from structured documents. Abstract: Most existing methods for ontology learning from textual docume...
XML schema mappings have been developed and studied in the context of XML data exchange, where a sou...
In the present work we explore application of XML schema mapping in conceptual modeling of XML schem...
The increasing amount of XML datasets available to casual users increases the necessity of investiga...
This dissertation studies structured document content reuse problem. In structured document content ...
The Graph Neural Network is a relatively new machine learning method capable of encoding data as wel...