We present a novel algorithm and validation method for disambiguating author names in very large bibliographic data sets and apply it to the full Web of Science (WoS) citation index. Our algorithm relies only upon the author and citation graphs available for the whole period covered by the WoS. A pair-wise publication similarity metric, which is based on common co-authors, self-citations, shared references and citations, is established to perform a two-step agglomerative clustering that first connects individual papers and then merges similar clusters. This parameterized model is optimized using an h-index based recall measure, favoring the correct assignment of well-cited publications, and a name-initials-based precision using WoS metadata...
Abstract—Person name disambiguation is essential to dis-tinguish between persons that share the same...
peer reviewedAuthor name disambiguation in bibliographic databases is the problem of grouping togeth...
Author name disambiguation is a challenging problem in computer science. The problem arises from the...
We present a novel algorithm and validation method for disambiguating author names in very large bib...
available at the end of the article We present a novel algorithm and validation method for disambigu...
The desire for definitive data and the semantic web drive for inference over heterogeneous data sour...
The desire for definitive data and the semantic web drive for inference over heterogeneous data sour...
Name disambiguation can occur when one is seeking a list of publications of an author who has used d...
Name disambiguation can occur when one is seeking a list of publications of an author who has used d...
This work addresses the problem of author name homonymy in the Web of Science. Aiming for an efficie...
Properly identifying the author of a scientific article is an important task for giving credit, trac...
Recently, massive online academic resources have provided convenience for scientific study and resea...
The disambiguation of author names is an important and challenging task in bibliometrics. We propose...
The disambiguation of author names is an important and challenging task in bibliometrics. We propose...
As the number of authors is increasing exponentially over years, the number of authors sharing the s...
Abstract—Person name disambiguation is essential to dis-tinguish between persons that share the same...
peer reviewedAuthor name disambiguation in bibliographic databases is the problem of grouping togeth...
Author name disambiguation is a challenging problem in computer science. The problem arises from the...
We present a novel algorithm and validation method for disambiguating author names in very large bib...
available at the end of the article We present a novel algorithm and validation method for disambigu...
The desire for definitive data and the semantic web drive for inference over heterogeneous data sour...
The desire for definitive data and the semantic web drive for inference over heterogeneous data sour...
Name disambiguation can occur when one is seeking a list of publications of an author who has used d...
Name disambiguation can occur when one is seeking a list of publications of an author who has used d...
This work addresses the problem of author name homonymy in the Web of Science. Aiming for an efficie...
Properly identifying the author of a scientific article is an important task for giving credit, trac...
Recently, massive online academic resources have provided convenience for scientific study and resea...
The disambiguation of author names is an important and challenging task in bibliometrics. We propose...
The disambiguation of author names is an important and challenging task in bibliometrics. We propose...
As the number of authors is increasing exponentially over years, the number of authors sharing the s...
Abstract—Person name disambiguation is essential to dis-tinguish between persons that share the same...
peer reviewedAuthor name disambiguation in bibliographic databases is the problem of grouping togeth...
Author name disambiguation is a challenging problem in computer science. The problem arises from the...