Many recent works on Entity Resolution (ER) leverage Deep Learning techniques involving language models to improve effectiveness. This is applied to both main steps of ER, i.e., blocking and matching. Several pre-trained embeddings have been tested, with the most popular ones being fastText and variants of the BERT model. However, there is no detailed analysis of their pros and cons. To cover this gap, we perform a thorough experimental analysis of 12 popular language models over 17 established benchmark datasets. First, we assess their vectorization overhead for converting all input entities into dense embeddings vectors. Second, we investigate their blocking performance, performing a detailed scalability analysis, and comparing them with ...
Nowadays, data integration must often manage noisy data, also containing attribute values written in...
Entity Resolution is the task of identifying pairs of entity profiles that represent the same real-w...
International audienceEntity resolution aims to identify descriptions of the same entity within or a...
Entity Resolution (ER), a core task of Data Integration, detects different entity profiles that corr...
BERT has set a new state-of-the-art performance on entity resolution (ER) task, largely owed to fine...
Entity Resolution (ER) lies at the core of data integration, with a bulk of research focusing on its...
Entity Resolution (ER) is the problem of matching the records that refer to the same entity within o...
Entity resolution (ER) aims at matching records that refer to the same real-world entity. Although ...
Entity Resolution (ER) is a fundamental task of data integration: it identifies different representa...
Entity Resolution (ER) seeks to understand which records refer to the same entity (e.g., matching pr...
Abstract—In the Web of data, entities are described by inter-linked data rather than documents on th...
Our research focuses on three sub-tasks of entity analysis: fine-grained entity typing (FGET), entit...
Entity matching (EM) finds data instances that refer to the same real-world entity. In this thesis w...
International audience—In the Web of data, entities are described by inter-linked data rather than d...
In this paper, we propose a semantic-aware blocking framework for entity resolution (ER). The propos...
Nowadays, data integration must often manage noisy data, also containing attribute values written in...
Entity Resolution is the task of identifying pairs of entity profiles that represent the same real-w...
International audienceEntity resolution aims to identify descriptions of the same entity within or a...
Entity Resolution (ER), a core task of Data Integration, detects different entity profiles that corr...
BERT has set a new state-of-the-art performance on entity resolution (ER) task, largely owed to fine...
Entity Resolution (ER) lies at the core of data integration, with a bulk of research focusing on its...
Entity Resolution (ER) is the problem of matching the records that refer to the same entity within o...
Entity resolution (ER) aims at matching records that refer to the same real-world entity. Although ...
Entity Resolution (ER) is a fundamental task of data integration: it identifies different representa...
Entity Resolution (ER) seeks to understand which records refer to the same entity (e.g., matching pr...
Abstract—In the Web of data, entities are described by inter-linked data rather than documents on th...
Our research focuses on three sub-tasks of entity analysis: fine-grained entity typing (FGET), entit...
Entity matching (EM) finds data instances that refer to the same real-world entity. In this thesis w...
International audience—In the Web of data, entities are described by inter-linked data rather than d...
In this paper, we propose a semantic-aware blocking framework for entity resolution (ER). The propos...
Nowadays, data integration must often manage noisy data, also containing attribute values written in...
Entity Resolution is the task of identifying pairs of entity profiles that represent the same real-w...
International audienceEntity resolution aims to identify descriptions of the same entity within or a...