Tokenised Flow Matching for Hierarchical Simulation Based Inference

· Source: Machine Learning · Field: Technology & Digital — Artificial Intelligence & Machine Learning, Data Science & Analytics, Mathematics & Computational Sciences · Depth: Expert, quick

Summary

Simulation Based Inference (SBI) faces a significant bottleneck due to the high cost of simulator evaluations, particularly in hierarchical models with shared global parameters and exchangeable site-level parameters. Existing hierarchical SBI methods improve efficiency by factorizing the posterior but still require multi-site simulations per training sample. A new approach, likelihood factorisation (LF), trains from single-site simulations by learning a per-site neural simulator surrogate and then assembling synthetic multi-site observations for amortized inference of the full hierarchical posterior. Building on this, Tokenised Flow Matching for Posterior Estimation (TFMPE) is proposed, which uses a tokenised flow matching technique to support function-valued observations via LF. The authors introduce a new benchmark for hierarchical SBI and validate TFMPE on it, as well as on infectious disease and computational fluid dynamics models, demonstrating well-calibrated posteriors and reduced computational cost.

Key takeaway

For AI Scientists and Machine Learning Engineers working with hierarchical Simulation Based Inference, consider adopting Tokenised Flow Matching for Posterior Estimation (TFMPE) to significantly reduce computational costs. Your teams can achieve well-calibrated posteriors even with complex, function-valued observations by leveraging likelihood factorisation and single-site simulations, thereby accelerating model development and deployment in resource-intensive domains like infectious disease modeling or computational fluid dynamics.

Key insights

Likelihood factorisation and tokenised flow matching significantly reduce simulation costs in hierarchical SBI.

Principles

Method

TFMPE learns per-site neural surrogates via likelihood factorisation, then uses tokenised flow matching to support function-valued observations for hierarchical posterior estimation.

In practice

Topics

Best for: AI Scientist, Machine Learning Engineer, Research Scientist

Related on AIssential

Open in AIssential →

Editorial summary, takeaway, and curation by AIssential. Original article published by Machine Learning.