Kernel Selection is Model Selection: A Unified Complexity-Penalized Approach for MMD Two-Sample Tests
Summary
The paper "Kernel Selection is Model Selection: A Unified Complexity-Penalized Approach for MMD Two-Sample Tests" introduces Complexity-Penalized MMD (CP-MMD), a novel criterion for nonparametric two-sample testing that addresses the critical challenge of data-driven kernel selection. Existing Maximum Mean Discrepancy (MMD) methods either overfit due to violating i.i.d. assumptions or cannot scale to continuous kernel search spaces like deep kernels. CP-MMD reframes kernel selection as a model selection problem, applying a two-sample uniform concentration inequality to derive a penalty term that accounts for optimization complexity. This enables grid-free maximization over continuous parametric classes, including scalar bandwidths, polynomial features, and deep network parameters. The method is proven to maximize true test power while ensuring unconditional Type-I validity, outperforming state-of-the-art methods like the median heuristic, MMDAgg, and Liu's ratio criterion across linear, polynomial-feature, and deep kernel regimes, and demonstrating $B$-free deployment costs.
Key takeaway
For AI Engineers and Research Scientists developing or deploying two-sample tests, CP-MMD offers a robust and efficient solution for data-driven kernel selection. Its ability to perform grid-free optimization over continuous kernel spaces, including deep neural networks, resolves critical overfitting and scalability issues inherent in prior methods. You should consider integrating CP-MMD to maximize test power and ensure Type-I validity, especially when working with complex, high-dimensional data or deep kernel architectures, as it significantly reduces deployment costs compared to aggregation methods.
Key insights
CP-MMD unifies kernel selection as a complexity-penalized model selection problem for robust two-sample testing.
Principles
- Kernel selection is a model selection problem.
- Data-driven kernel optimization requires complexity penalization.
- Uniform concentration inequalities can bound post-optimization MMD.
Method
CP-MMD maximizes empirical MMD minus a calibrated complexity penalty $\widehat{C}_1\cdot\widetilde{G}(h)$, where $\widetilde{G}(h)$ is a spectral-norm bound on kernel search space complexity, enabling grid-free optimization.
In practice
- Use CP-MMD for continuous kernel search spaces.
- Calibrate $\widehat{C}_1$ using null permutations for optimal performance.
- CP-MMD offers $B$-free deployment cost for deep kernels.
Topics
- Maximum Mean Discrepancy
- Kernel Selection
- Two-Sample Testing
- Complexity-Penalized MMD
- Uniform Concentration Inequality
Best for: AI Engineer, Research Scientist, AI Scientist, Machine Learning Engineer, Data Scientist
Related on AIssential
Editorial summary, takeaway, and curation by AIssential. Original article published by stat.ML updates on arXiv.org.