Bench-MFG: A Benchmark Suite for Learning in Stationary Mean Field Games
Summary
Bench-MFG is a new benchmark suite designed to standardize the evaluation of learning algorithms in stationary Mean Field Games (MFGs), addressing the current fragmentation in research. The suite introduces a taxonomy of problem classes, including No-Interaction, Contractive, Lasry-Lions Monotone, Potential, and Dynamics-Coupled games, providing prototypical environments for each. It also features MF-Garnets, a method for procedurally generating random MFG instances to enable robust statistical testing. Researchers benchmarked various learning algorithms, such as Fixed Point Iteration, Policy Iteration, Fictitious Play, Online Mirror Descent (OMD), and a novel black-box exploitability minimizer called MF-PSO, across these diverse environments. The implementation leverages JAX for a 2000x speedup compared to classical Python, with code available on GitHub. Empirical results inform guidelines for future experimental comparisons in the field.
Key takeaway
For research scientists developing or evaluating Mean Field Game algorithms, you should adopt the Bench-MFG suite to ensure robust and reproducible comparisons. This means testing your algorithms across the provided taxonomy of MFG classes and utilizing MF-Garnets for statistical validation. Prioritize computationally efficient implementations, such as those in JAX, to facilitate exhaustive hyperparameter tuning and comprehensive regime testing, thereby strengthening the empirical rigor of your findings.
Key insights
Bench-MFG standardizes Mean Field Game evaluation through a diverse benchmark suite and procedural generation.
Principles
- MFG evaluation requires diverse problem classes.
- Procedural generation enables rigorous statistical testing.
- Exploitability minimization is a valid objective.
Method
Bench-MFG defines a taxonomy of MFG problem classes, provides prototypical environments, and introduces MF-Garnets for generating random instances. It benchmarks algorithms like MF-PSO by minimizing exploitability.
In practice
- Use JAX for significant computational speedups.
- Compare against Fixed Point, Fictitious Play, OMD, and MF-PSO.
- Test algorithms across diverse MFG regimes and parameters.
Topics
- Mean Field Games
- Reinforcement Learning
- Benchmark Suite
- Multi-Agent Systems
- MF-PSO
Code references
Best for: Research Scientist, AI Researcher, AI Scientist, Machine Learning Engineer
Related on AIssential
Editorial summary, takeaway, and curation by AIssential. Original article published by cs.MA updates on arXiv.org.