Reference reconciliation is the problem of identifying when di®erent references (i.e., sets of attribute values) in a dataset correspond to the same real-world entity. Most previous lit-erature assumed references to a single class that had a fair number of attributes (e.g., research publications). We con-sider complex information spaces: our references belong to multiple related classes and each reference may have very few attribute values. A prime example of such a space is Per-sonal Information Management, where the goal is to provide a coherent view of all the information on one's desktop. Our reconciliation algorithm has three principal features. First, we exploit the associations between references to de-sign new methods for refer...
All responsible academic libraries record their reference transactions. It is good practice to know ...
Fostering both the creation and the linking of data with the scope of supporting the growth of the L...
Identity uncertainty is a pervasive problem in real-world data analysis. It arises whenever objects ...
Data integration systems aims at facilitating the management of heterogeneous data sources. When hug...
International audienceThe reference reconciliation problem consists in decid- ing whether different ...
International audienceThe reference reconciliation problem consists in deciding whether different id...
Comprehensive bibliographies often rely on community contributions. In such a setting, de-duplicatio...
International audienceThe reference reconciliation problem consists of deciding whether different id...
This report examines what is involved when a speaker overtly selects one formulation over another by...
Information overload is a common symptom in the Internet age nowadays. Search engines assist users t...
In comparing reference unification to reference linking, the authors found that reference linking yi...
One foundational goal of artificial intelligence is to build intelligent agents which interact with ...
In this paper we address the problem of data cleaning when multiple data sources are merged to creat...
International audienceThe reference reconciliation problem consists of deciding whether different id...
The paper deals with the analysis of reference based on the theory of Halliday that can be analyzed ...
All responsible academic libraries record their reference transactions. It is good practice to know ...
Fostering both the creation and the linking of data with the scope of supporting the growth of the L...
Identity uncertainty is a pervasive problem in real-world data analysis. It arises whenever objects ...
Data integration systems aims at facilitating the management of heterogeneous data sources. When hug...
International audienceThe reference reconciliation problem consists in decid- ing whether different ...
International audienceThe reference reconciliation problem consists in deciding whether different id...
Comprehensive bibliographies often rely on community contributions. In such a setting, de-duplicatio...
International audienceThe reference reconciliation problem consists of deciding whether different id...
This report examines what is involved when a speaker overtly selects one formulation over another by...
Information overload is a common symptom in the Internet age nowadays. Search engines assist users t...
In comparing reference unification to reference linking, the authors found that reference linking yi...
One foundational goal of artificial intelligence is to build intelligent agents which interact with ...
In this paper we address the problem of data cleaning when multiple data sources are merged to creat...
International audienceThe reference reconciliation problem consists of deciding whether different id...
The paper deals with the analysis of reference based on the theory of Halliday that can be analyzed ...
All responsible academic libraries record their reference transactions. It is good practice to know ...
Fostering both the creation and the linking of data with the scope of supporting the growth of the L...
Identity uncertainty is a pervasive problem in real-world data analysis. It arises whenever objects ...