This paper addresses the problem of identifying topics which describe information content, in restricted size sets of scientific papers extracted from publication databases. Conventional computational approaches, based on natural language processing using unsupervised classification algorithms, typically require large numbers of papers to achieve adequate training. The approach presented here uses a simpler word-frequency-based approach coupled with context modeling. An example is provided of its application to corpora resulting from a curated literature search site for COVID-19 research publications. The results are compared with a conventional human-based approach, indicating partial overlap in the topics identified. The findings suggest...
Web issue analysis, a new automated technique designed to rapidly give timely management intelligenc...
In this work, we study the problem of characterizing an unlabelled corpus of biomedical documents in...
Copyright © 2014 ISSR Journals. This is an open access article distributed under the Creative Common...
This paper addresses the problem of identifying topics which describe information content, in restri...
Topic indexing is the task of identifying the main topics covered by a document. These are useful fo...
It is estimated that the world’s data will increase to roughly 160 billion terabytes by 2025, with m...
Researchers that investigate the media’s coverage of health have historically relied on keyword sear...
Identifying the topic of an article can involve a lot of manual work. The manual processes can be ex...
Patients often search for information on the web about treatments and diseases after they are discha...
Topic modeling algorithms, such as LDA, find topics, hidden structures, in document corpora in an un...
∗Signatures are on file in the Graduate School. Discovery of latent semantic groupings and identific...
The number of digital medical documents is increasing continuously; several medical websites share a...
In publication driven domains such as the scientic community the availability of topic information i...
Large organizations often face the critical challenge of sharing information and maintaining connect...
Decker R, Scholz S. Unsupervised Topic Detection in document collections: an application in marketin...
Web issue analysis, a new automated technique designed to rapidly give timely management intelligenc...
In this work, we study the problem of characterizing an unlabelled corpus of biomedical documents in...
Copyright © 2014 ISSR Journals. This is an open access article distributed under the Creative Common...
This paper addresses the problem of identifying topics which describe information content, in restri...
Topic indexing is the task of identifying the main topics covered by a document. These are useful fo...
It is estimated that the world’s data will increase to roughly 160 billion terabytes by 2025, with m...
Researchers that investigate the media’s coverage of health have historically relied on keyword sear...
Identifying the topic of an article can involve a lot of manual work. The manual processes can be ex...
Patients often search for information on the web about treatments and diseases after they are discha...
Topic modeling algorithms, such as LDA, find topics, hidden structures, in document corpora in an un...
∗Signatures are on file in the Graduate School. Discovery of latent semantic groupings and identific...
The number of digital medical documents is increasing continuously; several medical websites share a...
In publication driven domains such as the scientic community the availability of topic information i...
Large organizations often face the critical challenge of sharing information and maintaining connect...
Decker R, Scholz S. Unsupervised Topic Detection in document collections: an application in marketin...
Web issue analysis, a new automated technique designed to rapidly give timely management intelligenc...
In this work, we study the problem of characterizing an unlabelled corpus of biomedical documents in...
Copyright © 2014 ISSR Journals. This is an open access article distributed under the Creative Common...