The surrogate Gibbs-posterior of a corrected stochastic MALA: Towards uncertainty quantification for neural networks
Summary
Sebastian Bieringer, Gregor Kasieczka, Maximilian F. Steffen, and Mathias Trabs introduce a corrected stochastic MALA (csMALA) method designed to improve uncertainty quantification in neural networks. While standard stochastic MALA (sMALA) scales to large datasets, it samples from a surrogate posterior that deviates from the true Gibbs-posterior due to reduced sample size. The proposed csMALA incorporates a simple correction term that reduces the distance between its surrogate posterior and the original Gibbs-posterior as the full sample size increases, all while maintaining scalability. The authors prove a PAC-Bayes oracle inequality for the surrogate posterior in a nonparametric regression model and demonstrate its application to Bayesian neural networks. They analyze credible ball diameter and coverage for shallow networks and show optimal contraction rates for deep networks, confirming these practical advantages in a high-dimensional simulation study.
Key takeaway
For research scientists developing Bayesian neural networks, understanding the implications of csMALA is crucial. This method offers a scalable approach to more accurately quantify uncertainties by bringing the surrogate posterior closer to the true Gibbs-posterior. You should consider integrating csMALA into your workflow to achieve more reliable credible intervals and contraction rates, especially when working with large datasets where standard sMALA might introduce significant approximation errors.
Key insights
Corrected stochastic MALA (csMALA) improves Gibbs-posterior approximation for scalable uncertainty quantification in neural networks.
Principles
- Correction terms can align surrogate posteriors with true posteriors.
- Uncertainty quantification is possible via surrogate posterior sampling.
Method
csMALA introduces a simple correction term to standard sMALA, reducing the distance between the surrogate posterior and the Gibbs-posterior as the full sample size increases, thereby improving approximation quality while retaining scalability.
In practice
- Quantify uncertainties by sampling from the csMALA surrogate posterior.
- Apply csMALA to Bayesian neural networks for improved credibility.
Topics
- Corrected Stochastic MALA
- Gibbs-posterior Distribution
- Uncertainty Quantification
- Bayesian Neural Networks
- Markov Chain Monte Carlo
Code references
Best for: Research Scientist, AI Scientist, Machine Learning Engineer
Related on AIssential
Editorial summary, takeaway, and curation by AIssential. Original article published by JMLR.