On the Robustness of Kernel Goodness-of-Fit Tests

· Source: JMLR · Field: Technology & Digital — Artificial Intelligence & Machine Learning, Data Science & Analytics · Depth: Expert, quick

Summary

A new study by Xing Liu and François-Xavier Briol, published in 2025, addresses the robustness of kernel goodness-of-fit (GoF) tests, which are often criticized for rejecting models as sample sizes increase, given that "all models are wrong." The research demonstrates that current kernel GoF tests lack both qualitative and quantitative robustness under common definitions. Furthermore, the authors show that robustification methods like tilted kernels, effective in parameter estimation, do not adequately ensure robustness in the testing context. To overcome these limitations, the paper introduces the first robust kernel GoF test, which leverages kernel Stein discrepancy (KSD) balls. This novel framework is capable of encompassing various established perturbation models, including Huber's contamination and density-band models, resolving a long-standing open problem in the field.

Key takeaway

For AI Researchers and Statisticians evaluating probabilistic models, recognize that standard kernel goodness-of-fit tests are not robust and will likely reject even "good enough" models with sufficient data. You should consider adopting the new robust kernel GoF test based on kernel Stein discrepancy (KSD) balls to more accurately assess if your model is a mild perturbation of the true data-generating process, ensuring practical relevance in your model validation efforts.

Key insights

Existing kernel goodness-of-fit tests lack robustness, necessitating a new framework for practical model validation.

Principles

Method

The proposed robust kernel goodness-of-fit test utilizes kernel Stein discrepancy (KSD) balls to handle various perturbation models, including Huber's contamination and density-band models.

In practice

Topics

Code references

Best for: AI Researcher, AI Scientist, Research Scientist

Related on AIssential

Open in AIssential →

Editorial summary, takeaway, and curation by AIssential. Original article published by JMLR.