Name matching—recognizing when two different strings are likely to denote the same entity—is an important task in many legal information systems, such as case-management systems. The naming conventions peculiar to legal cases limit the effectiveness of generic approximate string-matching algorithms in this task. This paper proposes a three-stage framework for name matching, identifies how each stage in the framework addresses the naming variations that typically arise in legal cases, describes several alternative approaches to each stage, and evaluates the performance of various combinations of the alternatives on a representative collection of names drawn from a United States District Court case management system. The best tradeoff between...
Identifying names --- e.g., author names or company names --- is still an open problem. In this pape...
Approximate string matching (ASM) is a challenging problem, which aims to match different string exp...
In the presence of dirty data, a search for specific information by a standard query (e.g., search f...
Names are important in many societies, even in technologically oriented ones which use e.g. ID syste...
Information explosion is a problem for everyone nowadays. It is a great challenge to all kinds of bu...
Finding and matching personal names is at the core of an increasing number of applications: from tex...
Information explosion is a problem for everyone nowadays. It is a great challenge to all kinds of bu...
Approximate proper-name matching uses concepts of approximate string matching and applies them to sp...
Viable on-line search systems require reasonable capabilities to automatically detect (and hopefully...
Name matching is a fundamental task in various domains, including data integration, record linkage, ...
Rather than collect data from a variety of surveys, it is often more efficient to merge information ...
Misspellings of organism scientific names create barriers to optimal storage and organization of bio...
<div><p>Misspellings of organism scientific names create barriers to optimal storage and organizatio...
This paper describes the development of a ground truth dataset of culturally diverse Romanized names...
In Finnish e-invoicing standards, there is a lack of granular standardization which leads to informa...
Identifying names --- e.g., author names or company names --- is still an open problem. In this pape...
Approximate string matching (ASM) is a challenging problem, which aims to match different string exp...
In the presence of dirty data, a search for specific information by a standard query (e.g., search f...
Names are important in many societies, even in technologically oriented ones which use e.g. ID syste...
Information explosion is a problem for everyone nowadays. It is a great challenge to all kinds of bu...
Finding and matching personal names is at the core of an increasing number of applications: from tex...
Information explosion is a problem for everyone nowadays. It is a great challenge to all kinds of bu...
Approximate proper-name matching uses concepts of approximate string matching and applies them to sp...
Viable on-line search systems require reasonable capabilities to automatically detect (and hopefully...
Name matching is a fundamental task in various domains, including data integration, record linkage, ...
Rather than collect data from a variety of surveys, it is often more efficient to merge information ...
Misspellings of organism scientific names create barriers to optimal storage and organization of bio...
<div><p>Misspellings of organism scientific names create barriers to optimal storage and organizatio...
This paper describes the development of a ground truth dataset of culturally diverse Romanized names...
In Finnish e-invoicing standards, there is a lack of granular standardization which leads to informa...
Identifying names --- e.g., author names or company names --- is still an open problem. In this pape...
Approximate string matching (ASM) is a challenging problem, which aims to match different string exp...
In the presence of dirty data, a search for specific information by a standard query (e.g., search f...