We introduce a simple and efficient method for clustering and identifying temporal trends in hyper-linked document databases. Our method can scale to large datasets because it exploits the underlying regularity often found in hyper-linked document databases. Because of this scalability, we can use our method to study the temporal trends of individual clusters in a statistically meaningful manner. As an example of our approach, we give a summary of the temporal trends found in a scientific literature database with thousands of documents.
The growing availability of large diachronic corpora of scientific literature offers the opportunity...
Chronological corpora are collections of texts ordered in time. Texts are often grouped into equal t...
Tracking the dynamics of science and early detection of the emerging research trends could potential...
International audienceWe address here two major challenges presented by dynamic data mining: 1) the ...
Clustering is an essential data mining task with numerous applications. Clustering is the process of...
In this work we propose a data-driven methodology for identifying temporal trends in a corpus of med...
The extraction of significant, relevant, and useful trends from massive document collections, such a...
Most classification methods are based on the assumption that data conforms to a stationary distribut...
Trend prediction has become an extremely popular practice in many industrial sectors and academia. I...
We address here two major challenges presented by dynamic data mining: 1) the stability challenge: w...
PACLIC / The University of the Philippines Visayas Cebu College Cebu City, Philippines / November 20...
International audienceThe textual content of a document and its publication date are intertwined. Fo...
This paper considers the problem of analyzing the development of a document collection over time wit...
Trend analysis and anomaly detection is gaining more and more interest since more than a decade. It ...
www.laboratories.telekom.com We propose a method for the detection of trends in social bookmarking s...
The growing availability of large diachronic corpora of scientific literature offers the opportunity...
Chronological corpora are collections of texts ordered in time. Texts are often grouped into equal t...
Tracking the dynamics of science and early detection of the emerging research trends could potential...
International audienceWe address here two major challenges presented by dynamic data mining: 1) the ...
Clustering is an essential data mining task with numerous applications. Clustering is the process of...
In this work we propose a data-driven methodology for identifying temporal trends in a corpus of med...
The extraction of significant, relevant, and useful trends from massive document collections, such a...
Most classification methods are based on the assumption that data conforms to a stationary distribut...
Trend prediction has become an extremely popular practice in many industrial sectors and academia. I...
We address here two major challenges presented by dynamic data mining: 1) the stability challenge: w...
PACLIC / The University of the Philippines Visayas Cebu College Cebu City, Philippines / November 20...
International audienceThe textual content of a document and its publication date are intertwined. Fo...
This paper considers the problem of analyzing the development of a document collection over time wit...
Trend analysis and anomaly detection is gaining more and more interest since more than a decade. It ...
www.laboratories.telekom.com We propose a method for the detection of trends in social bookmarking s...
The growing availability of large diachronic corpora of scientific literature offers the opportunity...
Chronological corpora are collections of texts ordered in time. Texts are often grouped into equal t...
Tracking the dynamics of science and early detection of the emerging research trends could potential...