Split Hamiltonian Monte Carlo

Babak Shahbaba; Shiwei Lan; Wesley O. Johnson; Radford M. Neal

doi:10.1007/s11222-012-9373-1

Split Hamiltonian Monte Carlo

Babak Shahbaba, Shiwei Lan, Wesley O. Johnson, Radford M. Neal

Research output: Contribution to journal › Article › peer-review

42 Scopus citations

Abstract

We show how the Hamiltonian Monte Carlo algorithm can sometimes be speeded up by "splitting" the Hamiltonian in a way that allows much of the movement around the state space to be done at low computational cost. One context where this is possible is when the log density of the distribution of interest (the potential energy function) can be written as the log of a Gaussian density, which is a quadratic function, plus a slowly-varying function. Hamiltonian dynamics for quadratic energy functions can be analytically solved. With the splitting technique, only the slowly-varying part of the energy needs to be handled numerically, and this can be done with a larger stepsize (and hence fewer steps) than would be necessary with a direct simulation of the dynamics. Another context where splitting helps is when the most important terms of the potential energy function and its gradient can be evaluated quickly, with only a slowly-varying part requiring costly computations. With splitting, the quick portion can be handled with a small stepsize, while the costly portion uses a larger stepsize. We show that both of these splitting approaches can reduce the computational cost of sampling from the posterior distribution for a logistic regression model, using either a Gaussian approximation centered on the posterior mode, or a Hamiltonian split into a term that depends on only a small number of critical cases, and another term that involves the larger number of cases whose influence on the posterior distribution is small.

Original language	English (US)
Pages (from-to)	339-349
Number of pages	11
Journal	Statistics and Computing
Volume	24
Issue number	3
DOIs	https://doi.org/10.1007/s11222-012-9373-1
State	Published - May 2014
Externally published	Yes

Keywords

Bayesian analysis
Hamiltonian dynamics
Markov chain Monte Carlo

ASJC Scopus subject areas

Theoretical Computer Science
Statistics and Probability
Statistics, Probability and Uncertainty
Computational Theory and Mathematics

Access to Document

10.1007/s11222-012-9373-1

Cite this

@article{8f615977b8864e5ab10b315d00e6fd52,

title = "Split Hamiltonian Monte Carlo",

abstract = "We show how the Hamiltonian Monte Carlo algorithm can sometimes be speeded up by {"}splitting{"} the Hamiltonian in a way that allows much of the movement around the state space to be done at low computational cost. One context where this is possible is when the log density of the distribution of interest (the potential energy function) can be written as the log of a Gaussian density, which is a quadratic function, plus a slowly-varying function. Hamiltonian dynamics for quadratic energy functions can be analytically solved. With the splitting technique, only the slowly-varying part of the energy needs to be handled numerically, and this can be done with a larger stepsize (and hence fewer steps) than would be necessary with a direct simulation of the dynamics. Another context where splitting helps is when the most important terms of the potential energy function and its gradient can be evaluated quickly, with only a slowly-varying part requiring costly computations. With splitting, the quick portion can be handled with a small stepsize, while the costly portion uses a larger stepsize. We show that both of these splitting approaches can reduce the computational cost of sampling from the posterior distribution for a logistic regression model, using either a Gaussian approximation centered on the posterior mode, or a Hamiltonian split into a term that depends on only a small number of critical cases, and another term that involves the larger number of cases whose influence on the posterior distribution is small.",

keywords = "Bayesian analysis, Hamiltonian dynamics, Markov chain Monte Carlo",

author = "Babak Shahbaba and Shiwei Lan and Johnson, {Wesley O.} and Neal, {Radford M.}",

note = "Funding Information: Acknowledgements B. Shahbaba is supported by the National Science Foundation, Grant No. IIS-1216045. R.M. Neal{\textquoteright}s work is sup- ported by the Natural Sciences and Engineering Research Council of Canada. He holds a Canada Research Chair in Statistics and Machine Learning.",

year = "2014",

month = may,

doi = "10.1007/s11222-012-9373-1",

language = "English (US)",

volume = "24",

pages = "339--349",

journal = "Statistics and Computing",

issn = "0960-3174",

publisher = "Springer Netherlands",

number = "3",

}

TY - JOUR

T1 - Split Hamiltonian Monte Carlo

AU - Shahbaba, Babak

AU - Lan, Shiwei

AU - Johnson, Wesley O.

AU - Neal, Radford M.

N1 - Funding Information: Acknowledgements B. Shahbaba is supported by the National Science Foundation, Grant No. IIS-1216045. R.M. Neal’s work is sup- ported by the Natural Sciences and Engineering Research Council of Canada. He holds a Canada Research Chair in Statistics and Machine Learning.

PY - 2014/5

Y1 - 2014/5

N2 - We show how the Hamiltonian Monte Carlo algorithm can sometimes be speeded up by "splitting" the Hamiltonian in a way that allows much of the movement around the state space to be done at low computational cost. One context where this is possible is when the log density of the distribution of interest (the potential energy function) can be written as the log of a Gaussian density, which is a quadratic function, plus a slowly-varying function. Hamiltonian dynamics for quadratic energy functions can be analytically solved. With the splitting technique, only the slowly-varying part of the energy needs to be handled numerically, and this can be done with a larger stepsize (and hence fewer steps) than would be necessary with a direct simulation of the dynamics. Another context where splitting helps is when the most important terms of the potential energy function and its gradient can be evaluated quickly, with only a slowly-varying part requiring costly computations. With splitting, the quick portion can be handled with a small stepsize, while the costly portion uses a larger stepsize. We show that both of these splitting approaches can reduce the computational cost of sampling from the posterior distribution for a logistic regression model, using either a Gaussian approximation centered on the posterior mode, or a Hamiltonian split into a term that depends on only a small number of critical cases, and another term that involves the larger number of cases whose influence on the posterior distribution is small.

AB - We show how the Hamiltonian Monte Carlo algorithm can sometimes be speeded up by "splitting" the Hamiltonian in a way that allows much of the movement around the state space to be done at low computational cost. One context where this is possible is when the log density of the distribution of interest (the potential energy function) can be written as the log of a Gaussian density, which is a quadratic function, plus a slowly-varying function. Hamiltonian dynamics for quadratic energy functions can be analytically solved. With the splitting technique, only the slowly-varying part of the energy needs to be handled numerically, and this can be done with a larger stepsize (and hence fewer steps) than would be necessary with a direct simulation of the dynamics. Another context where splitting helps is when the most important terms of the potential energy function and its gradient can be evaluated quickly, with only a slowly-varying part requiring costly computations. With splitting, the quick portion can be handled with a small stepsize, while the costly portion uses a larger stepsize. We show that both of these splitting approaches can reduce the computational cost of sampling from the posterior distribution for a logistic regression model, using either a Gaussian approximation centered on the posterior mode, or a Hamiltonian split into a term that depends on only a small number of critical cases, and another term that involves the larger number of cases whose influence on the posterior distribution is small.

KW - Bayesian analysis

KW - Hamiltonian dynamics

KW - Markov chain Monte Carlo

UR - http://www.scopus.com/inward/record.url?scp=84898540704&partnerID=8YFLogxK

UR - http://www.scopus.com/inward/citedby.url?scp=84898540704&partnerID=8YFLogxK

U2 - 10.1007/s11222-012-9373-1

DO - 10.1007/s11222-012-9373-1

M3 - Article

AN - SCOPUS:84898540704

SN - 0960-3174

VL - 24

SP - 339

EP - 349

JO - Statistics and Computing

JF - Statistics and Computing

IS - 3

ER -

Split Hamiltonian Monte Carlo

Abstract

Keywords

ASJC Scopus subject areas

Access to Document

Other files and links

Fingerprint

Cite this