Supervised entity resolution methods rely on labeled record pairs for learning matching patterns between two or more data sources. Active learning minimizes the labeling effort by selecting informative pairs for labeling. The existing active learning methods for entity resolution all target two-source matching scenarios and ignore signals that only exist in multi-source settings, such as the Web of Data. In this paper, we propose ALMSER, a graph-boosted active learning method for multi-source entity resolution. To the best of our knowledge, ALMSER is the first active learning-based entity resolution method that is especially tailored to the multi-source setting. ALMSER exploits the rich correspondence graph that exists in multi-source sett...
In entity matching, a fundamental issue while training a classifier to label pairs of entities as ei...
© 2019 IEEE. Personal use of this material is permitted. Permission from IEEE must be obtained for ...
Thesis (Ph.D.), Department of Computer Science, Washington State UniversityUsing a graph representat...
Supervised entity resolution methods rely on labeled record pairs for learning matching patterns bet...
The goal of entity resolution, also known as duplicate detection and record linkage, is to identify ...
Entity resolution is one of the central challenges when integrating data from large numbers of data ...
Entity resolution is the task of identifying records in one or more data sources which refer to the ...
Entity resolution is a key aspect of data quality, identifying which records correspond to the same ...
We consider a serious, previously-unexplored challenge facing al-most all approaches to scaling up e...
Entity resolution (ER) is the task of deciding which records in one or more databases refer to the s...
Entity Resolution refers to the process of identifying records which represent the same real-world e...
Entity Resolution is the task of identifying which records in a database refer to the same entity. A...
In this paper, we address multi-labeler active learning, where data labels can be acquired from mult...
Entity resolution (ER) seeks to identify which records in a data set refer to the same real-world en...
Entity resolution (ER) is the task of finding records that refer to the same real-world entities. A ...
In entity matching, a fundamental issue while training a classifier to label pairs of entities as ei...
© 2019 IEEE. Personal use of this material is permitted. Permission from IEEE must be obtained for ...
Thesis (Ph.D.), Department of Computer Science, Washington State UniversityUsing a graph representat...
Supervised entity resolution methods rely on labeled record pairs for learning matching patterns bet...
The goal of entity resolution, also known as duplicate detection and record linkage, is to identify ...
Entity resolution is one of the central challenges when integrating data from large numbers of data ...
Entity resolution is the task of identifying records in one or more data sources which refer to the ...
Entity resolution is a key aspect of data quality, identifying which records correspond to the same ...
We consider a serious, previously-unexplored challenge facing al-most all approaches to scaling up e...
Entity resolution (ER) is the task of deciding which records in one or more databases refer to the s...
Entity Resolution refers to the process of identifying records which represent the same real-world e...
Entity Resolution is the task of identifying which records in a database refer to the same entity. A...
In this paper, we address multi-labeler active learning, where data labels can be acquired from mult...
Entity resolution (ER) seeks to identify which records in a data set refer to the same real-world en...
Entity resolution (ER) is the task of finding records that refer to the same real-world entities. A ...
In entity matching, a fundamental issue while training a classifier to label pairs of entities as ei...
© 2019 IEEE. Personal use of this material is permitted. Permission from IEEE must be obtained for ...
Thesis (Ph.D.), Department of Computer Science, Washington State UniversityUsing a graph representat...