Presented on September 16, 2019 at 11:00 a.m. in the Groseclose Building, Room 402.Jelani Nelson is an EECS faculty member at UC Berkeley, where he is a member of the Theory Group.Runtime: 60:15 minutesIn the 'frequent items' problem one sees a sequence of items in a stream (e.g. a stream of words coming into a search query engine like Google) and wants to report a small list of items containing all frequent items. In the 'change detection' problem one sees two streams, say one from yesterday and one from today, and wants to report a small list of items containing all those whose frequencies changed significantly. For both of these problems, we would like algorithms that use memory substantially sublinear in the length of the stream. We d...
We study the problem of finding the k most frequent items in a stream of items for the recently prop...
In this paper we present PFDCMSS (Parallel Forward Decay Count-Min Space Saving) which, to the best ...
We study the classic problem of finding l_1 heavy hitters in the streaming model. In the general tur...
An old and fundamental problem in databases and data streams is that of finding the heavy hitters, a...
The task of finding heavy hitters is one of the best known and well studied problems in the area of ...
Streaming model supplies solutions for handling enormous data flows for over 20 years now. The mode...
A core mining problem is to find items that occur more than one would expect. These may be called ou...
A stream can be thought of as a very large set of data, sometimes even infinite, which arrives seque...
The Hierarchical Heavy Hitters problem extends the notion of frequent items to data ar-ranged in a h...
Data Mining is one of the central activities associated with understanding and exploiting the world...
The problem of mining Correlated Heavy Hitters (CHH) from a two- dimensional data stream has been in...
The frequent items problem is to process a stream of items and find all items occurring more than a ...
Sometimes data is generated unboundedly and at such a fast pace that it is no longer possible to sto...
Mining frequent itemsets from transactional data streams is challenging due to the nature of the exp...
We study the problem of identifying items with heavy weights in the sliding window of a weighted dat...
We study the problem of finding the k most frequent items in a stream of items for the recently prop...
In this paper we present PFDCMSS (Parallel Forward Decay Count-Min Space Saving) which, to the best ...
We study the classic problem of finding l_1 heavy hitters in the streaming model. In the general tur...
An old and fundamental problem in databases and data streams is that of finding the heavy hitters, a...
The task of finding heavy hitters is one of the best known and well studied problems in the area of ...
Streaming model supplies solutions for handling enormous data flows for over 20 years now. The mode...
A core mining problem is to find items that occur more than one would expect. These may be called ou...
A stream can be thought of as a very large set of data, sometimes even infinite, which arrives seque...
The Hierarchical Heavy Hitters problem extends the notion of frequent items to data ar-ranged in a h...
Data Mining is one of the central activities associated with understanding and exploiting the world...
The problem of mining Correlated Heavy Hitters (CHH) from a two- dimensional data stream has been in...
The frequent items problem is to process a stream of items and find all items occurring more than a ...
Sometimes data is generated unboundedly and at such a fast pace that it is no longer possible to sto...
Mining frequent itemsets from transactional data streams is challenging due to the nature of the exp...
We study the problem of identifying items with heavy weights in the sliding window of a weighted dat...
We study the problem of finding the k most frequent items in a stream of items for the recently prop...
In this paper we present PFDCMSS (Parallel Forward Decay Count-Min Space Saving) which, to the best ...
We study the classic problem of finding l_1 heavy hitters in the streaming model. In the general tur...