Nutpie: state-of-the-art mass matrix adaptation for HMC

· Source: Statistical Modeling, Causal Inference, and Social Science · Field: Technology & Digital — Artificial Intelligence & Machine Learning, Data Science & Analytics · Depth: Expert, short

Summary

The new "Nutpie sampler" significantly outperforms Stan's and PyMC's samplers, typically being twice as fast, primarily due to superior mass-matrix adaptation within the Hamiltonian Monte Carlo (HMC) framework. This improvement stems from better initialization using gradients at the initial point and leveraging two sources of information: the position and the gradient of the log density. The *arXiv* paper, "Preconditioning Hamiltonian Monte Carlo by minimizing Fisher Divergence," details the algorithm, which also offers a low-rank plus diagonal approximation for decorrelating hierarchical models. The geometric average of the covariance of draws and the inverse covariance of scores minimizes Fisher divergence, leading to a better diagonal preconditioner. This development, led by Adrian Seyboldt, Eliot L. Carlson, and Bob Carpenter, represents a notable advancement in computational statistics.

Key takeaway

Nutpie accelerates Hamiltonian Monte Carlo (HMC) by up to 2x over Stan and PyMC through novel diagonal mass-matrix adaptation. This method uses gradient-based initialization and geometrically averages position and gradient information to minimize Fisher divergence. It also supports low-rank plus diagonal approximations, reducing N^3 operations to K^2*N for efficient decorrelation of hierarchical models.

Topics

Best for: AI Scientist, AI Researcher, Data Scientist, Research Scientist

Related on AIssential

Open in AIssential →

Editorial summary, takeaway, and curation by AIssential. Original article published by Statistical Modeling, Causal Inference, and Social Science.