Abstract. Link discovery is the problem of linking entities between two or more datasets, based on some (possibly unknown) specification. A blocking scheme is a one-to-many mapping from entities to blocks. Blocking methods avoid O(n2) comparisons by clustering entities into blocks, and limiting the evaluation of link specifications to entity pairs within blocks. Current link-discovery blocking methods explicitly assume that two RDF datasets are provided as input, and need to be linked. In this paper, we assume instead that two heterogeneous dataset collections, comprising arbitrary numbers of RDF and tabular datasets, are provided as input. We show that data model heterogeneity can be addressed by representing RDF datasets as property table...
fan2012aInternational audienceIt is a trend to publish RDF data on the web, so that users can share ...
International audienceMany techniques were recently proposed to automate the linkage of RDF datasets...
International audienceA link key between two RDF datasets D1 and D2 is a set ofpairs of properties a...
International audienceEstablishing identity links across RDF datasets is a central and challenging t...
Record linkage, referred to also as entity resolution, is the process of identifying pairs of record...
International audienceIn this paper, we are interested in the discovery of link keys among two diffe...
Nowadays, data integration must often manage noisy data, also containing attribute values written in...
International audienceA link key between two RDF datasets D1 and D2 is a set of pairs of properties ...
International audienceThanks to the initiative of Linked Open Data, the RDF datasets that are publis...
Abstract. As Linked Open Data is gaining traction, publishers incor-porate more their data to the cl...
Nowadays, data integration must often manage noisy data, also containing attribute values written in...
Abstract. Links between heterogeneous data sets may be found by using a gen-eralisation of keys in d...
By specifying that published datasets must link to other existing datasets, the 4th linked data prin...
International audienceCollective entity linking is a core natural language processing task, which co...
fan2012aInternational audienceIt is a trend to publish RDF data on the web, so that users can share ...
International audienceMany techniques were recently proposed to automate the linkage of RDF datasets...
International audienceA link key between two RDF datasets D1 and D2 is a set ofpairs of properties a...
International audienceEstablishing identity links across RDF datasets is a central and challenging t...
Record linkage, referred to also as entity resolution, is the process of identifying pairs of record...
International audienceIn this paper, we are interested in the discovery of link keys among two diffe...
Nowadays, data integration must often manage noisy data, also containing attribute values written in...
International audienceA link key between two RDF datasets D1 and D2 is a set of pairs of properties ...
International audienceThanks to the initiative of Linked Open Data, the RDF datasets that are publis...
Abstract. As Linked Open Data is gaining traction, publishers incor-porate more their data to the cl...
Nowadays, data integration must often manage noisy data, also containing attribute values written in...
Abstract. Links between heterogeneous data sets may be found by using a gen-eralisation of keys in d...
By specifying that published datasets must link to other existing datasets, the 4th linked data prin...
International audienceCollective entity linking is a core natural language processing task, which co...
fan2012aInternational audienceIt is a trend to publish RDF data on the web, so that users can share ...
International audienceMany techniques were recently proposed to automate the linkage of RDF datasets...
International audienceA link key between two RDF datasets D1 and D2 is a set ofpairs of properties a...