Name disambiguation can occur when one is seeking a list of publications of an author who has used different name variations and when there are multiple other authors with the same name. We present an efficient integrative framework for solving the name disambiguation problem: a blocking method retrieves candidate classes of authors with similar names and a clustering method, DBSCAN, clusters papers by author. The distance metric between papers used in DBSCAN is calculated by an online active selection support vector machine algorithm (LASVM), yielding a simpler model, lower test errors and faster prediction time than a standard SVM. We prove that by recasting transitivity as density reachability in DBSCAN, transitivity is guaranteed for co...
The disambiguation of author names is an important and challenging task in bibliometrics. We propose...
Author disambiguation is the problem of determining whether records in a publications database that ...
Author disambiguation is the problem of determining whether records in a publications database refer...
Name disambiguation can occur when one is seeking a list of publications of an author who has used d...
Abstract—Person name disambiguation is essential to dis-tinguish between persons that share the same...
We present a novel algorithm and validation method for disambiguating author names in very large bib...
Name disambiguation in databases is a non-trivial task because people's names are often not unique a...
This work addresses the problem of author name homonymy in the Web of Science. Aiming for an efficie...
In this paper, we propose a clustering method for disambiguating abbreviated author names appearing ...
Author name disambiguation is a challenging problem in computer science. The problem arises from the...
peer reviewedAuthor name disambiguation in bibliographic databases is the problem of grouping togeth...
Abstract In the academic world, the number of scientists grows every year and so does the number of...
Abstract. In this paper, we propose a heuristic-based hierarchical clustering (HHC) method to deal w...
In digital libraries, ambiguous author names occur due to the existence of multiple authors with the...
Entity disambiguation is an important step in many information retrieval applications. This paper pr...
The disambiguation of author names is an important and challenging task in bibliometrics. We propose...
Author disambiguation is the problem of determining whether records in a publications database that ...
Author disambiguation is the problem of determining whether records in a publications database refer...
Name disambiguation can occur when one is seeking a list of publications of an author who has used d...
Abstract—Person name disambiguation is essential to dis-tinguish between persons that share the same...
We present a novel algorithm and validation method for disambiguating author names in very large bib...
Name disambiguation in databases is a non-trivial task because people's names are often not unique a...
This work addresses the problem of author name homonymy in the Web of Science. Aiming for an efficie...
In this paper, we propose a clustering method for disambiguating abbreviated author names appearing ...
Author name disambiguation is a challenging problem in computer science. The problem arises from the...
peer reviewedAuthor name disambiguation in bibliographic databases is the problem of grouping togeth...
Abstract In the academic world, the number of scientists grows every year and so does the number of...
Abstract. In this paper, we propose a heuristic-based hierarchical clustering (HHC) method to deal w...
In digital libraries, ambiguous author names occur due to the existence of multiple authors with the...
Entity disambiguation is an important step in many information retrieval applications. This paper pr...
The disambiguation of author names is an important and challenging task in bibliometrics. We propose...
Author disambiguation is the problem of determining whether records in a publications database that ...
Author disambiguation is the problem of determining whether records in a publications database refer...