ADtrees, a data structure useful for caching sufficient statistics, have been successfully adapted to grow lazily when memory is limited and to update sequentially with an incrementally updated dataset. For low arity sym-bolic features, ADtrees trade a slight increase in query time for a reduction in overall tree size. Unfortunately, for high arity features, the same technique can often re-sult in a very large increase in query time and a nearly negligible tree size reduction. In the dynamic (lazy) version of the tree, both query time and tree size can increase for some applications. Here we present two modifications to the ADtree which can be used sepa-rately or in combination to achieve the originally in-tended space-time tradeoff in the ...
Abstract. We introduce a new O(lg lg n)-competitive binary search tree data structure called poketre...
In some applications, data capture dominates query processing. For example, monitoring moving object...
In some applications, data capture dominates query processing. For example, monitoring moving object...
ADtrees, a data structure useful for caching sufficient statistics, have been successfully adapted t...
Ingcreasingly, data-mining algorithms must deal with databases that continuously grow over time. The...
The problem of discovering association rules in large data-bases has received considerable research ...
This paper has no novel learning or statistics: it is concerned with making a wide class of preexis...
Abstract—We consider the problem of efficiently storing n-gram counts for large n over very large co...
This paper introduces new algorithms and data structures for quick counting for machine learning dat...
This paper introduces new algorithms and data structures for quick counting for machine learning dat...
The explosion of big data poses a serious problem to the efficient retrieval and management of infor...
We propose two new improvements for bagging methods on evolving data streams. Recently, two new vari...
Large datasets, on the order of GB and TB, are increasingly common as abundant computational resourc...
The emerging class of instance-optimized systems has shown potential to achieve high performance by ...
Metric Access Methods (MAM) are employed to accelerate the processing of similarity queries, such as...
Abstract. We introduce a new O(lg lg n)-competitive binary search tree data structure called poketre...
In some applications, data capture dominates query processing. For example, monitoring moving object...
In some applications, data capture dominates query processing. For example, monitoring moving object...
ADtrees, a data structure useful for caching sufficient statistics, have been successfully adapted t...
Ingcreasingly, data-mining algorithms must deal with databases that continuously grow over time. The...
The problem of discovering association rules in large data-bases has received considerable research ...
This paper has no novel learning or statistics: it is concerned with making a wide class of preexis...
Abstract—We consider the problem of efficiently storing n-gram counts for large n over very large co...
This paper introduces new algorithms and data structures for quick counting for machine learning dat...
This paper introduces new algorithms and data structures for quick counting for machine learning dat...
The explosion of big data poses a serious problem to the efficient retrieval and management of infor...
We propose two new improvements for bagging methods on evolving data streams. Recently, two new vari...
Large datasets, on the order of GB and TB, are increasingly common as abundant computational resourc...
The emerging class of instance-optimized systems has shown potential to achieve high performance by ...
Metric Access Methods (MAM) are employed to accelerate the processing of similarity queries, such as...
Abstract. We introduce a new O(lg lg n)-competitive binary search tree data structure called poketre...
In some applications, data capture dominates query processing. For example, monitoring moving object...
In some applications, data capture dominates query processing. For example, monitoring moving object...