Statistics computation over data streams is often required by many applications, including processing of relational type queries, data mining and high speed network management. Among various s tatistics, order statistics computation is one of the most challenging, and is employed in many real applications, such as web ranking aggregation and log mining, sensor data analysis, trends and fleeting opportunities detection in stock markets and load balanced data partitioning for distributed computation.In this thesis, we investige three important problems in computing order statistics over data streams:1. Computing rank queries over data streams with relative error guarantee.2. Computing rank queries over data streams with duplication.3. Computi...
Recently, due to the imprecise nature of the data generated from a variety of streaming applications...
Estimating the sortedness of a sequence has found applications in, e.g., sorting algorithms, databas...
Discovering a bucket order B from a collection of possibly noisy full rankings is a fundamental prob...
In this paper, we investigated the problem of approximately processing rank queries against distinct...
Duplicates in data streams may often be observed by the projection on a subspace and/or multiple rec...
A rank query is essentially to find a data element with a given rank against a monotonic order speci...
In modern applications, it is a big challenge that analyzing the order statistics about the most rec...
We study the problem of continuous monitoring of top-k queries over multiple non-synchronized stream...
High-volume data streams are too large and grow too quickly to store entirely in working memory, int...
When trying to process a data stream in small space, how important is the order in which the data ar...
We consider the problem of approximate sorting of a data stream (in one pass) with limited internal ...
In this paper, we propose and study the problem of top-m rank aggregation of spatial objects in stre...
\u3cp\u3eWhile traditional data management systems focus on evaluating single, ad hoc queries over s...
This thesis is concerned with the study of problems related to the measurement of disorder in the da...
We introduce a new class of algorithms to estimate the cardinality of very large multisets using con...
Recently, due to the imprecise nature of the data generated from a variety of streaming applications...
Estimating the sortedness of a sequence has found applications in, e.g., sorting algorithms, databas...
Discovering a bucket order B from a collection of possibly noisy full rankings is a fundamental prob...
In this paper, we investigated the problem of approximately processing rank queries against distinct...
Duplicates in data streams may often be observed by the projection on a subspace and/or multiple rec...
A rank query is essentially to find a data element with a given rank against a monotonic order speci...
In modern applications, it is a big challenge that analyzing the order statistics about the most rec...
We study the problem of continuous monitoring of top-k queries over multiple non-synchronized stream...
High-volume data streams are too large and grow too quickly to store entirely in working memory, int...
When trying to process a data stream in small space, how important is the order in which the data ar...
We consider the problem of approximate sorting of a data stream (in one pass) with limited internal ...
In this paper, we propose and study the problem of top-m rank aggregation of spatial objects in stre...
\u3cp\u3eWhile traditional data management systems focus on evaluating single, ad hoc queries over s...
This thesis is concerned with the study of problems related to the measurement of disorder in the da...
We introduce a new class of algorithms to estimate the cardinality of very large multisets using con...
Recently, due to the imprecise nature of the data generated from a variety of streaming applications...
Estimating the sortedness of a sequence has found applications in, e.g., sorting algorithms, databas...
Discovering a bucket order B from a collection of possibly noisy full rankings is a fundamental prob...