We study the problems of decomposing and sharing user-defined aggregate functions in distributed and parallel computing. Aggre-gation usually needs to satisfy the distributive property to compute in parallel, and to leverage optimization in multidimensional data analysis and conjunctive query with aggregation. However, this property is too restricted to allow more aggregation to benefit from these advantages. We propose for user-defined aggregation functions a formal framework to relax the previous condition, and we map this framework to the MRC, an efficient computation model in MapReduce, to automatically generate efficient partial aggrega-tion functions. Moreover, we identify the complete conditions for sharing the result of practical us...
We consider the problem of resource sharing when processing large numbers of continuous queries. We ...
Distributed data aggregation is an important task, allowing the de-centralized determination of mean...
We consider the problem of handling aggregate computations in a scalable fashion in stream databases...
Applications of aggregation for information summary have great meanings in various fields. In big da...
Partial aggregation is of great importance in many dis-tributed data-parallel systems. Most notably,...
Summarization: An emerging challenge in modern distributed querying is to effi- ciently process mult...
Applications of aggregations for information summary have great meanings in variousfields. System bu...
The emergence of the Internet as a computing platform increases the demand for new classes of algori...
Computing multiple related group-bys and aggregates is one of the core operations of On-Line Analyti...
We examine the problem of eciently computing sum/count/avg aggregates over objects with nonzero ext...
The integration of computers into many facets of our lives has made the collection and storage of st...
Abstract. We extend the Constraint Handling Rules language with ag-gregates such as sum, count, find...
Efficient evaluation of aggregate functions in object-oriented databases (OODB) can have considerabl...
Edited by R. Mesiar and S. MontesInternational audienceThis position paper discusses the role of the...
Abstract. MapReduce, being inspired by the map and reduce primi-tives available in many functional l...
We consider the problem of resource sharing when processing large numbers of continuous queries. We ...
Distributed data aggregation is an important task, allowing the de-centralized determination of mean...
We consider the problem of handling aggregate computations in a scalable fashion in stream databases...
Applications of aggregation for information summary have great meanings in various fields. In big da...
Partial aggregation is of great importance in many dis-tributed data-parallel systems. Most notably,...
Summarization: An emerging challenge in modern distributed querying is to effi- ciently process mult...
Applications of aggregations for information summary have great meanings in variousfields. System bu...
The emergence of the Internet as a computing platform increases the demand for new classes of algori...
Computing multiple related group-bys and aggregates is one of the core operations of On-Line Analyti...
We examine the problem of eciently computing sum/count/avg aggregates over objects with nonzero ext...
The integration of computers into many facets of our lives has made the collection and storage of st...
Abstract. We extend the Constraint Handling Rules language with ag-gregates such as sum, count, find...
Efficient evaluation of aggregate functions in object-oriented databases (OODB) can have considerabl...
Edited by R. Mesiar and S. MontesInternational audienceThis position paper discusses the role of the...
Abstract. MapReduce, being inspired by the map and reduce primi-tives available in many functional l...
We consider the problem of resource sharing when processing large numbers of continuous queries. We ...
Distributed data aggregation is an important task, allowing the de-centralized determination of mean...
We consider the problem of handling aggregate computations in a scalable fashion in stream databases...