Beyond Expected Information Gain: Stable Bayesian Optimal Experimental Design with Integral Probability Metrics and Plug-and-Play Extensions
Summary
This work introduces a novel Bayesian Optimal Experimental Design (BOED) framework that replaces traditional Kullback-Leibler (KL) divergence-based expected information gain (EIG) with Integral Probability Metrics (IPMs). Classical EIG methods suffer from computational challenges due to nested expectations and inherent sensitivity to support mismatch, tail underestimation, and rare events. The proposed IPM-based framework, utilizing metrics like Wasserstein distance, Maximum Mean Discrepancy (MMD), and Energy Distance, offers enhanced stability under surrogate-model error and prior misspecification. Theoretical guarantees demonstrate that IPM-based utilities provide stronger geometry-aware stability. Empirical validation, including A/B testing and preference learning models, shows that IPMs yield more concentrated credible sets, improved computational efficiency (e.g., W1-based utility takes 0.3 seconds per design versus 14 seconds for KL), and more robust optimization landscapes. The framework also supports plug-and-play extensions to high-dimensional settings using neural optimal transport estimators, outperforming nested Monte Carlo and variational methods in 64-dimensional linear-Gaussian and 32-dimensional sign-ambiguous bimodal models.
Key takeaway
For AI Scientists and Machine Learning Engineers designing experiments in resource-constrained or high-dimensional settings, adopting Integral Probability Metrics (IPMs) for Bayesian Optimal Experimental Design is crucial. This approach offers significantly more stable utility landscapes and computational efficiency compared to traditional KL-divergence methods, reducing sensitivity to model errors and prior misspecification. Your experimental designs will be more robust, and optimization less prone to local minima, especially when integrating advanced neural estimators for complex posterior geometries.
Key insights
IPM-based BOED offers superior stability and computational efficiency over traditional KL-divergence methods, especially in high-dimensional settings.
Principles
- IPMs provide geometry-aware stability, avoiding density-ratio sensitivity.
- Bounded IPMs offer unconditional, uniform L1 control over utility errors.
- Prior approximations are naturally justified by IPMs operating on probability measures.
Method
The framework defines design utility via Integral Probability Metrics between prior and posterior distributions, enabling stable sample-based estimation without explicit density ratios and supporting plug-and-play neural estimators for high dimensions.
In practice
- Use IPMs for BOED to mitigate rare-event sensitivity.
- Employ neural optimal transport for high-dimensional utility estimation.
- Expect broader near-optimal design regions with IPM-based objectives.
Topics
- Bayesian Optimal Experimental Design
- Integral Probability Metrics
- Neural Optimal Transport
- Kullback-Leibler Divergence
- Stability Analysis
Best for: AI Scientist, Machine Learning Engineer, Research Scientist
Related on AIssential
Editorial summary, takeaway, and curation by AIssential. Original article published by stat.ML updates on arXiv.org.