Unsupervised discretization is a crucial step in many knowledge discovery tasks. The state-of-the-art method for one-dimensional data infers locally adaptive histograms using the minimum description length (MDL) principle, but the multi-dimensional case is far less studied: current methods consider the dimensions one at a time (if not independently), which result in discretizations based on rectangular cells of adaptive size. Unfortunately, this approach is unable to adequately characterize dependencies among dimensions and/or results in discretizations consisting of more cells (or bins) than is desirable. To address this problem, we propose an expressive model class that allows for far more flexible partitions of two-dimensional data. We...
We consider the problem of model selection using the Minimum Description Length (MDL) criterion for ...
Machine-Learned Likelihoods (MLL) is a method that, by combining modern machine-learning classificat...
Approximation of the optimal two-part minimum description length (MDL) code for given data, through ...
G-Enum histograms are a new fast and fully automated method for irregular histogram construction. By...
We regard histogram density estimation as a model selection problem. Our approach is based on the in...
The Minimum Description Length (MDL) principle is a general, well-founded theoretical formalization ...
We regard histogram density estimation as a model selection problem. Our approach is based on the ...
When considering a data set it is often unknown how complex it is, and hence it is difficult to asse...
Abstract(#br)We present a data-adaptive multivariate histogram estimator of an unknown density f bas...
We present tree- and list- structured density estimation methods for high dimensional binary/categor...
We present general sufficient conditions for the almost sure $L_1$-consistency of histogram density ...
This is an up-to-date introduction to and overview of the Minimum Description Length (MDL) Principle...
This is an up-to-date introduction to and overview of the Minimum Description Length (MDL) Principle...
Unsupervised representation learning aims at describing raw data efficiently to solve various downst...
Model selection plays an important part in machine learning and in artificial intelligence in genera...
We consider the problem of model selection using the Minimum Description Length (MDL) criterion for ...
Machine-Learned Likelihoods (MLL) is a method that, by combining modern machine-learning classificat...
Approximation of the optimal two-part minimum description length (MDL) code for given data, through ...
G-Enum histograms are a new fast and fully automated method for irregular histogram construction. By...
We regard histogram density estimation as a model selection problem. Our approach is based on the in...
The Minimum Description Length (MDL) principle is a general, well-founded theoretical formalization ...
We regard histogram density estimation as a model selection problem. Our approach is based on the ...
When considering a data set it is often unknown how complex it is, and hence it is difficult to asse...
Abstract(#br)We present a data-adaptive multivariate histogram estimator of an unknown density f bas...
We present tree- and list- structured density estimation methods for high dimensional binary/categor...
We present general sufficient conditions for the almost sure $L_1$-consistency of histogram density ...
This is an up-to-date introduction to and overview of the Minimum Description Length (MDL) Principle...
This is an up-to-date introduction to and overview of the Minimum Description Length (MDL) Principle...
Unsupervised representation learning aims at describing raw data efficiently to solve various downst...
Model selection plays an important part in machine learning and in artificial intelligence in genera...
We consider the problem of model selection using the Minimum Description Length (MDL) criterion for ...
Machine-Learned Likelihoods (MLL) is a method that, by combining modern machine-learning classificat...
Approximation of the optimal two-part minimum description length (MDL) code for given data, through ...