The Concrete Distribution: A Continuous Relaxation of Discrete Random Variables

Maddison, Chris J.; Mnih, Andriy; Teh, Yee Whye

Computer Science > Machine Learning

arXiv:1611.00712 (cs)

[Submitted on 2 Nov 2016 (v1), last revised 5 Mar 2017 (this version, v3)]

Title:The Concrete Distribution: A Continuous Relaxation of Discrete Random Variables

Authors:Chris J. Maddison, Andriy Mnih, Yee Whye Teh

View PDF

Abstract:The reparameterization trick enables optimizing large scale stochastic computation graphs via gradient descent. The essence of the trick is to refactor each stochastic node into a differentiable function of its parameters and a random variable with fixed distribution. After refactoring, the gradients of the loss propagated by the chain rule through the graph are low variance unbiased estimators of the gradients of the expected loss. While many continuous random variables have such reparameterizations, discrete random variables lack useful reparameterizations due to the discontinuous nature of discrete states. In this work we introduce Concrete random variables---continuous relaxations of discrete random variables. The Concrete distribution is a new family of distributions with closed form densities and a simple reparameterization. Whenever a discrete stochastic node of a computation graph can be refactored into a one-hot bit representation that is treated continuously, Concrete stochastic nodes can be used with automatic differentiation to produce low-variance biased gradients of objectives (including objectives that depend on the log-probability of latent stochastic nodes) on the corresponding discrete graph. We demonstrate the effectiveness of Concrete relaxations on density estimation and structured prediction tasks using neural networks.

Subjects:	Machine Learning (cs.LG); Machine Learning (stat.ML)
Cite as:	arXiv:1611.00712 [cs.LG]
	(or arXiv:1611.00712v3 [cs.LG] for this version)
	https://doi.org/10.48550/arXiv.1611.00712

Submission history

From: Chris J. Maddison [view email]
[v1] Wed, 2 Nov 2016 18:25:40 UTC (662 KB)
[v2] Sun, 6 Nov 2016 23:25:23 UTC (440 KB)
[v3] Sun, 5 Mar 2017 16:59:44 UTC (998 KB)

Computer Science > Machine Learning

Title:The Concrete Distribution: A Continuous Relaxation of Discrete Random Variables

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Machine Learning

Title:The Concrete Distribution: A Continuous Relaxation of Discrete Random Variables

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators