The joint density of a data stream is suitable for performing data mining tasks without having access to the original data. However, the methods proposed so far only target a small to medium number of variables, since their estimates rely on representing all the interdependencies between the variables of the data. High-dimensional data streams, which are becoming more and more frequent due to increasing numbers of interconnected devices, are, therefore, pushing these methods to their limits. To mitigate these limitations, we present an approach that projects the original data stream into a vector space and uses a set of representatives to provide an estimate. Due to the structure of the estimates, it enables the density estimation of higher...
Data Mining can be seen as an extension to statistics. It comprises the preparation of data and the ...
A variety of real-world applications heavily relies on the analysis of transient data streams. Due t...
Clustering of data streams has become a task of great interest in the recent years as such data form...
This paper presents an algorithm for density estimation over non-stationary high-dimensional data st...
Efficient density estimation over an open-ended stream of high-dimensional data is of primary import...
A growing number of real-world applications share the property that they have to deal with transient...
Data mining and machine learning algorithms usually operate directly on the data. However, if the da...
A key ingredient to modern data analysis is probability density estimation. However, it is well know...
A key ingredient to modern data analysis is probability density estimation. However, it is well know...
We address the problem of estimating discrete, continuous, and conditional joint densities online, i...
We address the problem of estimating a discrete joint density online, that is, the algorithm is only...
The ratio of two probability density functions is becoming a quantity of interest these days in the ...
Abstract—We address the problem of estimating a discrete joint density online, that is, the algorith...
Traditional pattern mining algorithms require access to the data, either in the form of a complete s...
Density estimation has wide applications in machine learning and data analysis techniques including ...
Data Mining can be seen as an extension to statistics. It comprises the preparation of data and the ...
A variety of real-world applications heavily relies on the analysis of transient data streams. Due t...
Clustering of data streams has become a task of great interest in the recent years as such data form...
This paper presents an algorithm for density estimation over non-stationary high-dimensional data st...
Efficient density estimation over an open-ended stream of high-dimensional data is of primary import...
A growing number of real-world applications share the property that they have to deal with transient...
Data mining and machine learning algorithms usually operate directly on the data. However, if the da...
A key ingredient to modern data analysis is probability density estimation. However, it is well know...
A key ingredient to modern data analysis is probability density estimation. However, it is well know...
We address the problem of estimating discrete, continuous, and conditional joint densities online, i...
We address the problem of estimating a discrete joint density online, that is, the algorithm is only...
The ratio of two probability density functions is becoming a quantity of interest these days in the ...
Abstract—We address the problem of estimating a discrete joint density online, that is, the algorith...
Traditional pattern mining algorithms require access to the data, either in the form of a complete s...
Density estimation has wide applications in machine learning and data analysis techniques including ...
Data Mining can be seen as an extension to statistics. It comprises the preparation of data and the ...
A variety of real-world applications heavily relies on the analysis of transient data streams. Due t...
Clustering of data streams has become a task of great interest in the recent years as such data form...