References are the main descriptive metadata used by digital libraries of scientific articles. These references can be represented by several formats and styles. Although considerable content variations can also occur in some metadata fields such as title, author names and publication venue. Duplicate records influence the quality of digital library services once they need to be appropriately identified and treated. This paper presents an approach to identifying duplicated bibliographic metadata. We extend our previous work so that instead of setting thresholds based on the scores returned by similarity functions, we use the scores to train classification algorithms which automatically identify duplicated references. The experiments show...
Comprehensive bibliographies often rely on community contributions. In such a setting, de-duplicatio...
The digital information age has brought with it the information seekers. These seekers, which are or...
This poster presents a set of hash keys for bibliographic records called bibkeys. Unlike other metho...
Digital libraries contain collections of digital objects, acquired from different sources, which can...
Περιέχει το πλήρες κείμενοPurpose - The purpose of this paper is to focus on duplicate record detect...
The paper describes a fault-tolerant method of selecting duplicate bibliographic records in catalogu...
Purpose - This paper aims to address the problem of enhancing the selection of titles offered by a d...
Objective: To automatically detect duplicate citations in a bibliographical database. Background: Ci...
Motivation: Duplicate publication impacts the quality of the scientific corpus, has been difficult t...
The paper proposes matching short forms (abbreviated titles from the citation report) with their cor...
Metadados bibliográficos duplicados são registros que correspondem a referências bibliográficas sema...
Metadados bibliográficos duplicados são registros que correspondem a referências bibliográficas sema...
Automatically extracted metadata from scholarly documents in PDF formats is usually noisy and hetero...
Motivation: Document similarity metrics such as PubMed’s “Find related articles ” feature, which hav...
In recent years, the Web of Science Core Collection and Scopus databases have become primary sources...
Comprehensive bibliographies often rely on community contributions. In such a setting, de-duplicatio...
The digital information age has brought with it the information seekers. These seekers, which are or...
This poster presents a set of hash keys for bibliographic records called bibkeys. Unlike other metho...
Digital libraries contain collections of digital objects, acquired from different sources, which can...
Περιέχει το πλήρες κείμενοPurpose - The purpose of this paper is to focus on duplicate record detect...
The paper describes a fault-tolerant method of selecting duplicate bibliographic records in catalogu...
Purpose - This paper aims to address the problem of enhancing the selection of titles offered by a d...
Objective: To automatically detect duplicate citations in a bibliographical database. Background: Ci...
Motivation: Duplicate publication impacts the quality of the scientific corpus, has been difficult t...
The paper proposes matching short forms (abbreviated titles from the citation report) with their cor...
Metadados bibliográficos duplicados são registros que correspondem a referências bibliográficas sema...
Metadados bibliográficos duplicados são registros que correspondem a referências bibliográficas sema...
Automatically extracted metadata from scholarly documents in PDF formats is usually noisy and hetero...
Motivation: Document similarity metrics such as PubMed’s “Find related articles ” feature, which hav...
In recent years, the Web of Science Core Collection and Scopus databases have become primary sources...
Comprehensive bibliographies often rely on community contributions. In such a setting, de-duplicatio...
The digital information age has brought with it the information seekers. These seekers, which are or...
This poster presents a set of hash keys for bibliographic records called bibkeys. Unlike other metho...