We consider the problem of estimating Shannon's entropy H from discrete data, in cases where the number of possible symbols is unknown or even countably infinite. The Pitman-Yor process, a generalization of Dirichlet process, provides a tractable prior distribution over the space of countably infinite discrete distributions, and has found major applications in Bayesian non- parametric statistics and machine learning. Here we show that it provides a natural family of priors for Bayesian entropy estimation, due to the fact that moments of the induced posterior distribution over H can be computed analytically. We derive formulas for the posterior mean (Bayes' least squares estimate) and variance under Dirichlet and Pitman-Yor process priors. M...
The need to estimate smooth probability distributions (a.k.a. probability densities) from finite sam...
Given iid samples drawn from a distribution with known parametric form, we propose the minimization ...
When constructing discrete (binned) distributions from samples of a data set, applications exist whe...
We consider the problem of estimating Shannon’s entropy H from discrete data, in cases where the num...
We consider the problem of estimating Shannon’s entropy H from discrete data, in cases where the num...
We study properties of popular near–uniform (Dirichlet) priors for learning undersampled probability...
: This paper is the first of two on the problem of estimating a function of a probability distributi...
The use of entropy related concepts goes from physics, such as in statistical mechanics, to evolutio...
We present a new class of estimators of Shannon entropy for severely undersampled discrete distribut...
Random probability measures are a cornerstone of Bayesian nonparametrics. By virtue of de Finetti's ...
This PhD report deals with the estimation of both Shannon entropy of distributions from independent ...
Shannon entropy of a probability distribution gives a weighted mean of a measure of information that...
Discrete random probability measures and the exchangeable random partitions they induce are key tool...
Given an i.i.d. sample (X1, Xn) drawn from an unknown discrete distribution P on a countably infinit...
We consider Bayesian estimation of information-theoretic quantities from data, using a Dirichlet pr...
The need to estimate smooth probability distributions (a.k.a. probability densities) from finite sam...
Given iid samples drawn from a distribution with known parametric form, we propose the minimization ...
When constructing discrete (binned) distributions from samples of a data set, applications exist whe...
We consider the problem of estimating Shannon’s entropy H from discrete data, in cases where the num...
We consider the problem of estimating Shannon’s entropy H from discrete data, in cases where the num...
We study properties of popular near–uniform (Dirichlet) priors for learning undersampled probability...
: This paper is the first of two on the problem of estimating a function of a probability distributi...
The use of entropy related concepts goes from physics, such as in statistical mechanics, to evolutio...
We present a new class of estimators of Shannon entropy for severely undersampled discrete distribut...
Random probability measures are a cornerstone of Bayesian nonparametrics. By virtue of de Finetti's ...
This PhD report deals with the estimation of both Shannon entropy of distributions from independent ...
Shannon entropy of a probability distribution gives a weighted mean of a measure of information that...
Discrete random probability measures and the exchangeable random partitions they induce are key tool...
Given an i.i.d. sample (X1, Xn) drawn from an unknown discrete distribution P on a countably infinit...
We consider Bayesian estimation of information-theoretic quantities from data, using a Dirichlet pr...
The need to estimate smooth probability distributions (a.k.a. probability densities) from finite sam...
Given iid samples drawn from a distribution with known parametric form, we propose the minimization ...
When constructing discrete (binned) distributions from samples of a data set, applications exist whe...