Information explosion is a problem for everyone nowadays. It is a great challenge to all kinds of businesses to maintain high quality of data in their information applications, such as data integration, text and web mining, information retrieval, search engine, etc. In such applications, matching names is one of the popular tasks. There are a number of name matching techniques available. Unfortunately, there is no existing name matching technique that performs the best in all situations. Therefore, a problem that every researcher or a practitioner has to face is how to select an appropriate technique for a given dataset. This paper analyses and evaluates a set of popular name matching techniques on several carefully designed different datas...
Identifying names --- e.g., author names or company names --- is still an open problem. In this pape...
approximately duplicate database records that refer to the same entity is essential for information ...
<div><p>Misspellings of organism scientific names create barriers to optimal storage and organizatio...
Information explosion is a problem for everyone nowadays. It is a great challenge to all kinds of bu...
Finding and matching personal names is at the core of an increasing number of applications: from tex...
Names are important in many societies, even in technologically oriented ones which use e.g. ID syste...
Name matching is a fundamental task in various domains, including data integration, record linkage, ...
Name matching—recognizing when two different strings are likely to denote the same entity—is an impo...
Approximate proper-name matching uses concepts of approximate string matching and applies them to sp...
The obvious need for using modem computer networking capabilities to enable the effective sharing of...
This paper compares several indexing methods for person names extracted from text, developed for an ...
Misspellings of organism scientific names create barriers to optimal storage and organization of bio...
The obvious need for using modern computer networking capabilities to enable the effective sharing o...
In the presence of dirty data, a search for specific information by a standard query (e.g., search f...
This paper describes the development of a ground truth dataset of culturally diverse Romanized names...
Identifying names --- e.g., author names or company names --- is still an open problem. In this pape...
approximately duplicate database records that refer to the same entity is essential for information ...
<div><p>Misspellings of organism scientific names create barriers to optimal storage and organizatio...
Information explosion is a problem for everyone nowadays. It is a great challenge to all kinds of bu...
Finding and matching personal names is at the core of an increasing number of applications: from tex...
Names are important in many societies, even in technologically oriented ones which use e.g. ID syste...
Name matching is a fundamental task in various domains, including data integration, record linkage, ...
Name matching—recognizing when two different strings are likely to denote the same entity—is an impo...
Approximate proper-name matching uses concepts of approximate string matching and applies them to sp...
The obvious need for using modem computer networking capabilities to enable the effective sharing of...
This paper compares several indexing methods for person names extracted from text, developed for an ...
Misspellings of organism scientific names create barriers to optimal storage and organization of bio...
The obvious need for using modern computer networking capabilities to enable the effective sharing o...
In the presence of dirty data, a search for specific information by a standard query (e.g., search f...
This paper describes the development of a ground truth dataset of culturally diverse Romanized names...
Identifying names --- e.g., author names or company names --- is still an open problem. In this pape...
approximately duplicate database records that refer to the same entity is essential for information ...
<div><p>Misspellings of organism scientific names create barriers to optimal storage and organizatio...