How can we maintain a dynamic profile capturing a user’s reading interest against the common interest? What are the queries that have been asked 1,000 times more frequently to a search engine from users in Asia than in Europe? To answer such interesting questions, we need to find discriminative items in multiple data streams. In this thesis, we show that, to exactly find all discriminative items in stream S1 against stream S2 by one scan, the space lower bound is O(|S| log (n1/|S|)), where S is the alphabet of items and n1 is the current size of S1. To tackle the space challenge, we develop three heuristic algorithms that can achieve high precision and recall using sub-linear space and processing time per item with respect to |S|. The compl...
Many critical applications, like intrusion detection or stock market analysis, require a nearly imme...
Traditional algorithms for frequent itemset discovery are designed for static data. They cannot be s...
One important challenge in data mining is the ability to deal with complex, voluminous and dynamic d...
Big data availability in areas such as social networks, online marketing systems and stock markets i...
This paper presents a single pass algorithm for mining discriminative Itemsets in data streams using...
In this paper, we present an efficient novel method for mining discriminative itemsets over data str...
Mining frequent itemsets in a datastream proves to be a difficult problem, as itemsets arrive in rap...
Discriminative itemsets can be more useful than frequent itemsets as the former identifies the frequ...
While there has been a lot of work on finding frequent itemsets in transaction data streams, none of...
We tackle the problem of discriminative itemset mining. Given a set of datasets, we want to find the...
The challenge of monitoring massive amounts of data gen-erated by communication networks has led to ...
The increasing prominence of data streams arising in a wide range of advanced applications such as f...
The increasing prominence of data streams arising in a wide range of advanced applications such as f...
Data on the Web is noisy, huge, and dynamic. This poses enormous challenges to most data mining tech...
Abstract Mining frequent itemsets in a datastream proves to be a difficult problem, as itemsets arri...
Many critical applications, like intrusion detection or stock market analysis, require a nearly imme...
Traditional algorithms for frequent itemset discovery are designed for static data. They cannot be s...
One important challenge in data mining is the ability to deal with complex, voluminous and dynamic d...
Big data availability in areas such as social networks, online marketing systems and stock markets i...
This paper presents a single pass algorithm for mining discriminative Itemsets in data streams using...
In this paper, we present an efficient novel method for mining discriminative itemsets over data str...
Mining frequent itemsets in a datastream proves to be a difficult problem, as itemsets arrive in rap...
Discriminative itemsets can be more useful than frequent itemsets as the former identifies the frequ...
While there has been a lot of work on finding frequent itemsets in transaction data streams, none of...
We tackle the problem of discriminative itemset mining. Given a set of datasets, we want to find the...
The challenge of monitoring massive amounts of data gen-erated by communication networks has led to ...
The increasing prominence of data streams arising in a wide range of advanced applications such as f...
The increasing prominence of data streams arising in a wide range of advanced applications such as f...
Data on the Web is noisy, huge, and dynamic. This poses enormous challenges to most data mining tech...
Abstract Mining frequent itemsets in a datastream proves to be a difficult problem, as itemsets arri...
Many critical applications, like intrusion detection or stock market analysis, require a nearly imme...
Traditional algorithms for frequent itemset discovery are designed for static data. They cannot be s...
One important challenge in data mining is the ability to deal with complex, voluminous and dynamic d...