This paper presents an unsupervised discretization method that performs density estimation for univariate data. The subintervals that the discretization produces can be used as the bins of a histogram. Histograms are a very simple and broadly understood means for displaying data, and our method automatically adapts bin widths to the data. It uses the log-likelihood as the scoring function to select cut points and the cross-validated log-likelihood to select the number of intervals. We compare this method with equal-width discretization where we also select the number of bins using the cross-validated log-likelihood and with equal-frequency discretization
Let p be an unknown and arbitrary probability distribution over [0, 1). We con-sider the problem of ...
We propose and implement a density estimation procedure which begins by turning density estimation i...
Let p be an unknown and arbitrary probability distribution over [0, 1). We con-sider the problem of ...
This paper presents an unsupervised discretization method that performs density estimation for univa...
Discretization, defined as a set of cuts over domains of attributes, represents an important pre-pro...
Data Mining can be seen as an extension to statistics. It comprises the preparation of data and the ...
Histograms are convenient non-parametric density estimators, which continue to be used ubiquitously....
We contribute to the study of data binning in density estimation. The particular disadvantage of his...
We consider estimation of multivariate densities with histograms which are based on data-dependent p...
We present a data-adaptive multivariate histogram estimator of an unknown density f based on n indep...
We present several multivariate histogram density estimates that are universally L1-optimal to withi...
We suggest a method for rendering a standard kernel density estimator unimodal: tilting the empirica...
We propose a fully automatic procedure for the construction of irregular histograms. For a given num...
Commonly, the data used in the real-world applications is composed by two types, the continuous data...
A natural way to estimate the probability density function of an unknown distribution from the sampl...
Let p be an unknown and arbitrary probability distribution over [0, 1). We con-sider the problem of ...
We propose and implement a density estimation procedure which begins by turning density estimation i...
Let p be an unknown and arbitrary probability distribution over [0, 1). We con-sider the problem of ...
This paper presents an unsupervised discretization method that performs density estimation for univa...
Discretization, defined as a set of cuts over domains of attributes, represents an important pre-pro...
Data Mining can be seen as an extension to statistics. It comprises the preparation of data and the ...
Histograms are convenient non-parametric density estimators, which continue to be used ubiquitously....
We contribute to the study of data binning in density estimation. The particular disadvantage of his...
We consider estimation of multivariate densities with histograms which are based on data-dependent p...
We present a data-adaptive multivariate histogram estimator of an unknown density f based on n indep...
We present several multivariate histogram density estimates that are universally L1-optimal to withi...
We suggest a method for rendering a standard kernel density estimator unimodal: tilting the empirica...
We propose a fully automatic procedure for the construction of irregular histograms. For a given num...
Commonly, the data used in the real-world applications is composed by two types, the continuous data...
A natural way to estimate the probability density function of an unknown distribution from the sampl...
Let p be an unknown and arbitrary probability distribution over [0, 1). We con-sider the problem of ...
We propose and implement a density estimation procedure which begins by turning density estimation i...
Let p be an unknown and arbitrary probability distribution over [0, 1). We con-sider the problem of ...