This thesis studies distance approximation in two closely related models- the streaming model and the two-party communication model. In the streaming model, a massive data stream is presented in an arbitrary order to a randomized algorithm that tries to approximate certain statistics of tile data with only a few (usually one) passes over the data. For instance, the data may be a flow of packets on the internet or a set of records in a large database. The size of the data necessitates the use of extremely efficient randomized approximation algorithms. Problems of interest include approximating the number of distinct elements, approximating the surprise index of a stream, or more generally, approximating the norm of a dynamically-changing vec...
A data stream is a transiently observed sequence of data elements that arrive unordered, with repeti...
We develop the first streaming algorithm and the first two-party communication protocol that uses a ...
We study the communication complexity of evaluating functions when the input data is randomly alloca...
We study the communication complexity of evaluating functions when the input data is randomly alloca...
In this PhD thesis, we consider two computational models that address problems that arise when proce...
In this PhD thesis, we consider two computational models that address problems that arise when proce...
In this PhD thesis, we consider two computational models that address problems that arise when proce...
Exact solutions are unattainable for important problems. The calculations are limited by the memory ...
We consider the problem of computing a (1+epsilon)-approximation of the Hamming distance between a p...
International audienceWe consider the problem of computing distance between a pattern of length n an...
In this paper we consider problems related to the sortedness of a data stream. First we investigate ...
Recall from last lecture the simple but useful model of one-way communication complexity. Alice has ...
Abstract. We develop the first streaming algorithm and the first two-party com-munication protocol t...
We consider the problem of computing distance between a pattern of length n and all n-length subword...
We define the Streaming Communication model that combines the main aspects of communication complexi...
A data stream is a transiently observed sequence of data elements that arrive unordered, with repeti...
We develop the first streaming algorithm and the first two-party communication protocol that uses a ...
We study the communication complexity of evaluating functions when the input data is randomly alloca...
We study the communication complexity of evaluating functions when the input data is randomly alloca...
In this PhD thesis, we consider two computational models that address problems that arise when proce...
In this PhD thesis, we consider two computational models that address problems that arise when proce...
In this PhD thesis, we consider two computational models that address problems that arise when proce...
Exact solutions are unattainable for important problems. The calculations are limited by the memory ...
We consider the problem of computing a (1+epsilon)-approximation of the Hamming distance between a p...
International audienceWe consider the problem of computing distance between a pattern of length n an...
In this paper we consider problems related to the sortedness of a data stream. First we investigate ...
Recall from last lecture the simple but useful model of one-way communication complexity. Alice has ...
Abstract. We develop the first streaming algorithm and the first two-party com-munication protocol t...
We consider the problem of computing distance between a pattern of length n and all n-length subword...
We define the Streaming Communication model that combines the main aspects of communication complexi...
A data stream is a transiently observed sequence of data elements that arrive unordered, with repeti...
We develop the first streaming algorithm and the first two-party communication protocol that uses a ...
We study the communication complexity of evaluating functions when the input data is randomly alloca...