The Gumbel-Softmax is a continuous distribution over the simplex that is often used as a relaxation of discrete distributions. Because it can be readily interpreted and easily reparameterized, it enjoys widespread use. We propose a modular and more flexible family of reparameterizable distributions where Gaussian noise is transformed into a one-hot approximation through an invertible function. This invertible function is composed of a modified softmax and can incorporate diverse transformations that serve different specific purposes. For example, the stick-breaking procedure allows us to extend the reparameterization trick to distributions with countably infinite support, thus enabling the use of our distribution along nonparametric models,...
We study the distribution of hard-, soft-, and adaptive soft-thresholding estimators within a linear...
We illustrate the detrimental effect, such as overconfident decisions, that exponential behavior can...
In the paper, we proposed a generakization of the gumbel distribution. Simpel properties of the dist...
The Gumbel-max trick is a method to draw a sample from a categorical distribution, given by its unno...
The Gumbel trick is a method to sample from a discrete probability distribution, or to estimate its ...
The Gumbel trick is a method to sample from a discrete probability distribution, or to estimate its ...
The reparameterization gradient has become a widely used method to obtain Monte Carlo gradients to o...
Variational autoencoders (VAE) have recently become one of the most interesting developments in deep...
Enabling machine learning classifiers to defer their decision to a downstream expert when the expert...
Gaussian smoothing (GS) is a derivative-free optimization (DFO) algorithm that estimates the gradien...
Is Stochastic Gradient Descent (SGD) substantially different from Glauber dynamics? This is a fundam...
Recent advances in diffusion models bring state-of-the-art performance on image generation tasks. Ho...
Post-training quantization (PTQ) is the go-to compression technique for large generative models, suc...
In this article, additional properties of the Gumbel-Burr XII distribution, denoted by (GBXII(L)), d...
We establish a connection between stochastic optimal control and generative models based on stochast...
We study the distribution of hard-, soft-, and adaptive soft-thresholding estimators within a linear...
We illustrate the detrimental effect, such as overconfident decisions, that exponential behavior can...
In the paper, we proposed a generakization of the gumbel distribution. Simpel properties of the dist...
The Gumbel-max trick is a method to draw a sample from a categorical distribution, given by its unno...
The Gumbel trick is a method to sample from a discrete probability distribution, or to estimate its ...
The Gumbel trick is a method to sample from a discrete probability distribution, or to estimate its ...
The reparameterization gradient has become a widely used method to obtain Monte Carlo gradients to o...
Variational autoencoders (VAE) have recently become one of the most interesting developments in deep...
Enabling machine learning classifiers to defer their decision to a downstream expert when the expert...
Gaussian smoothing (GS) is a derivative-free optimization (DFO) algorithm that estimates the gradien...
Is Stochastic Gradient Descent (SGD) substantially different from Glauber dynamics? This is a fundam...
Recent advances in diffusion models bring state-of-the-art performance on image generation tasks. Ho...
Post-training quantization (PTQ) is the go-to compression technique for large generative models, suc...
In this article, additional properties of the Gumbel-Burr XII distribution, denoted by (GBXII(L)), d...
We establish a connection between stochastic optimal control and generative models based on stochast...
We study the distribution of hard-, soft-, and adaptive soft-thresholding estimators within a linear...
We illustrate the detrimental effect, such as overconfident decisions, that exponential behavior can...
In the paper, we proposed a generakization of the gumbel distribution. Simpel properties of the dist...