On the Robustness of Kernel Goodness-of-Fit Tests
Summary
A new study by Xing Liu and François-Xavier Briol, published in 2025, addresses the robustness of kernel goodness-of-fit (GoF) tests, which are often criticized for rejecting models as sample sizes increase, given that "all models are wrong." The research demonstrates that current kernel GoF tests lack both qualitative and quantitative robustness under common definitions. Furthermore, the authors show that robustification methods like tilted kernels, effective in parameter estimation, do not adequately ensure robustness in the testing context. To overcome these limitations, the paper introduces the first robust kernel GoF test, which leverages kernel Stein discrepancy (KSD) balls. This novel framework is capable of encompassing various established perturbation models, including Huber's contamination and density-band models, resolving a long-standing open problem in the field.
Key takeaway
For AI Researchers and Statisticians evaluating probabilistic models, recognize that standard kernel goodness-of-fit tests are not robust and will likely reject even "good enough" models with sufficient data. You should consider adopting the new robust kernel GoF test based on kernel Stein discrepancy (KSD) balls to more accurately assess if your model is a mild perturbation of the true data-generating process, ensuring practical relevance in your model validation efforts.
Key insights
Existing kernel goodness-of-fit tests lack robustness, necessitating a new framework for practical model validation.
Principles
- "All models are wrong" implies GoF tests always reject with large samples.
- Robustness is critical for practical GoF testing.
- Tilted kernels are insufficient for GoF test robustness.
Method
The proposed robust kernel goodness-of-fit test utilizes kernel Stein discrepancy (KSD) balls to handle various perturbation models, including Huber's contamination and density-band models.
In practice
- Apply KSD balls for robust GoF testing.
- Consider Huber's contamination for perturbation models.
Topics
- Goodness-of-Fit Testing
- Kernel Methods
- Robustness
- Kernel Stein Discrepancy
- Perturbation Models
Code references
Best for: AI Researcher, AI Scientist, Research Scientist
Related on AIssential
Editorial summary, takeaway, and curation by AIssential. Original article published by JMLR.