On the Robustness of Kernel Goodness-of-Fit Tests

2024-12-31 · Source: JMLR · Field: Technology & Digital — Artificial Intelligence & Machine Learning, Data Science & Analytics · Depth: Expert, quick

Summary

A new study by Xing Liu and FranÃ§ois-Xavier Briol, published in 2025, addresses the robustness of kernel goodness-of-fit (GoF) tests, which are often criticized for rejecting models as sample sizes increase, given that "all models are wrong." The research demonstrates that current kernel GoF tests lack both qualitative and quantitative robustness under common definitions. Furthermore, the authors show that robustification methods like tilted kernels, effective in parameter estimation, do not adequately ensure robustness in the testing context. To overcome these limitations, the paper introduces the first robust kernel GoF test, which leverages kernel Stein discrepancy (KSD) balls. This novel framework is capable of encompassing various established perturbation models, including Huber's contamination and density-band models, resolving a long-standing open problem in the field.

Key takeaway

For AI Researchers and Statisticians evaluating probabilistic models, recognize that standard kernel goodness-of-fit tests are not robust and will likely reject even "good enough" models with sufficient data. You should consider adopting the new robust kernel GoF test based on kernel Stein discrepancy (KSD) balls to more accurately assess if your model is a mild perturbation of the true data-generating process, ensuring practical relevance in your model validation efforts.

Key insights

Existing kernel goodness-of-fit tests lack robustness, necessitating a new framework for practical model validation.

Principles

"All models are wrong" implies GoF tests always reject with large samples.
Robustness is critical for practical GoF testing.
Tilted kernels are insufficient for GoF test robustness.

Method

The proposed robust kernel goodness-of-fit test utilizes kernel Stein discrepancy (KSD) balls to handle various perturbation models, including Huber's contamination and density-band models.

In practice

Apply KSD balls for robust GoF testing.
Consider Huber's contamination for perturbation models.

Topics

Goodness-of-Fit Testing
Kernel Methods
Robustness
Kernel Stein Discrepancy
Perturbation Models

Code references

XingLLiu/robust-kernel-test

Best for: AI Researcher, AI Scientist, Research Scientist

Related on AIssential

Open in AIssential →

Editorial summary, takeaway, and curation by AIssential. Original article published by JMLR.