peer-reviewedTopical annotation of documents with keyphrases is a proven method for revealing the subject of scientific and research documents to both human readers and information retrieval systems. This article describes a machine learning-based keyphrase annotation method for scientific documents that utilizes Wikipedia as a thesaurus for candidate selection from documents' content. We have devised a set of 20 statistical, positional and semantical features for candidate phrases to capture and reflect various properties of those candidates that have the highest keyphraseness probability. We first introduce a simple unsupervised method for ranking and filtering the most probable keyphrases, and then evolve it into a novel supervised metho...
This paper proposes a supervised model for keyphrase extraction from research papers, which are embe...
This paper examines the differences between author-generated keywords and automatically generated ke...
We propose a large dataset for machine learning-based automatic keyphrase extraction. The dataset ha...
peer-reviewedTopical indexing of documents with keyphrases is a common method used for revealing the...
Doctor of PhilosophyDepartment of Computer ScienceCornelia CarageaDoina CarageaScholarly digital lib...
This paper connects two research areas: automatic tagging on the web and statistical keyphrase extra...
This paper proposes PositionRank, an unsupervised model for keyphrase extraction from scholarly docu...
Thesis: M. Eng., Massachusetts Institute of Technology, Department of Electrical Engineering and Com...
This research addresses the problem of automatic keyphrase extraction from large documents and back ...
Keyphrases are an important means of document summarization, clustering, and topic search. Only a sm...
A journal article is often accompanied by a list of keyphrases, composed of about five to fifteen im...
Increasing number of documents in the Web caused the growth of needs for tools supporting automatic ...
Abstract. Many academic journals and conferences require that each article will in-clude a list of k...
The keyphrases of a document are the textual units that characterize its content such as the topics ...
Many academic journals ask their authors to provide a list of about five to fifteen key words, to ap...
This paper proposes a supervised model for keyphrase extraction from research papers, which are embe...
This paper examines the differences between author-generated keywords and automatically generated ke...
We propose a large dataset for machine learning-based automatic keyphrase extraction. The dataset ha...
peer-reviewedTopical indexing of documents with keyphrases is a common method used for revealing the...
Doctor of PhilosophyDepartment of Computer ScienceCornelia CarageaDoina CarageaScholarly digital lib...
This paper connects two research areas: automatic tagging on the web and statistical keyphrase extra...
This paper proposes PositionRank, an unsupervised model for keyphrase extraction from scholarly docu...
Thesis: M. Eng., Massachusetts Institute of Technology, Department of Electrical Engineering and Com...
This research addresses the problem of automatic keyphrase extraction from large documents and back ...
Keyphrases are an important means of document summarization, clustering, and topic search. Only a sm...
A journal article is often accompanied by a list of keyphrases, composed of about five to fifteen im...
Increasing number of documents in the Web caused the growth of needs for tools supporting automatic ...
Abstract. Many academic journals and conferences require that each article will in-clude a list of k...
The keyphrases of a document are the textual units that characterize its content such as the topics ...
Many academic journals ask their authors to provide a list of about five to fifteen key words, to ap...
This paper proposes a supervised model for keyphrase extraction from research papers, which are embe...
This paper examines the differences between author-generated keywords and automatically generated ke...
We propose a large dataset for machine learning-based automatic keyphrase extraction. The dataset ha...