Knowledge discovery has received tremendous interests and fast developments in both text mining and social user mining. The main purpose is to search massive volumes of data for patterns as so-called knowledge. Knowledge can exist in dif-ferent formats such as texts or numbers. Knowledge can be observed or hidden in different hierarchies. Knowledge can even be user-generated such as social con-tent and social activity in Web 2.0 era. In this dissertation, we study a series of new knowledge discovery techniques using four data mining applications. First, we propose our novel framework on mining text databases using time series by bridg-ing two seemly unrelated domains- alphabets strings and numerical signals. We study how various transformat...
∗Signatures are on file in the Graduate School. We all witnessed the information explosion of the Wo...
Topic Modeling is a well-known unsupervised learning technique used when dealing with text data. It ...
The enormous amount of information stored in unstructured texts cannot simply be used for further pr...
Many document collections are by nature dynamic, evolving as the topics or events they describe chan...
Text mining or information discovery is that sub manner of information mining that is extensively be...
The popularity of Internet has caused an increasing amount of data. Data are not only rich in amount...
It is estimated that the world’s data will increase to roughly 160 billion terabytes by 2025, with m...
As large-scale digital text collections become abundant, the necessity of automatically summarizing ...
With the dramatic growth of text information, there is an increasing need for powerful text mining s...
A massive amount of information is stored as text in the real world. Classifying the texts according...
With the dramatic growth of text information, there is an increasing need for powerful text mining s...
International audienceWe present a system for mapping the structure of research topics in a corpus. ...
In mining technology the text mining plays a vital role in today’s life. Text mining is cluster data...
Many data mining techniques have been proposed for mining useful patterns in text documents. However...
We propose a generative model based on latent Dirichlet allocation for mining distinct topics in doc...
∗Signatures are on file in the Graduate School. We all witnessed the information explosion of the Wo...
Topic Modeling is a well-known unsupervised learning technique used when dealing with text data. It ...
The enormous amount of information stored in unstructured texts cannot simply be used for further pr...
Many document collections are by nature dynamic, evolving as the topics or events they describe chan...
Text mining or information discovery is that sub manner of information mining that is extensively be...
The popularity of Internet has caused an increasing amount of data. Data are not only rich in amount...
It is estimated that the world’s data will increase to roughly 160 billion terabytes by 2025, with m...
As large-scale digital text collections become abundant, the necessity of automatically summarizing ...
With the dramatic growth of text information, there is an increasing need for powerful text mining s...
A massive amount of information is stored as text in the real world. Classifying the texts according...
With the dramatic growth of text information, there is an increasing need for powerful text mining s...
International audienceWe present a system for mapping the structure of research topics in a corpus. ...
In mining technology the text mining plays a vital role in today’s life. Text mining is cluster data...
Many data mining techniques have been proposed for mining useful patterns in text documents. However...
We propose a generative model based on latent Dirichlet allocation for mining distinct topics in doc...
∗Signatures are on file in the Graduate School. We all witnessed the information explosion of the Wo...
Topic Modeling is a well-known unsupervised learning technique used when dealing with text data. It ...
The enormous amount of information stored in unstructured texts cannot simply be used for further pr...