Counterfactual Explanations for Deep Two-Sample Testing

2026-06-12 · Source: stat.ML updates on arXiv.org · Field: Technology & Digital — Artificial Intelligence & Machine Learning, Data Science & Analytics · Depth: Expert, extended

Summary

A counterfactual explanation framework for deep two-sample testing is proposed to address the limited insight provided by classical and deep two-sample tests into features driving distributional differences in high-dimensional structured data like images. The method combines a diffusion autoencoder with a pretrained deep two-sample test model, optimizing a Maximum Mean Discrepancy (MMD) objective in the test model's representation space. This generates sample-level edits that move observations from a source group toward a target group, explicitly reducing the measured discrepancy. Evaluation on synthetic 2D shape datasets (dSprites) and two MRI cohorts (ADNI, UK Biobank) shows that counterfactual transformations consistently increase p-values, indicating statistical closeness to the target distribution. For MRI, specifically in the CDR setting, the method increased mean Δp from 1.3e-4 to 8.65e-3 and mean Δt from 55.91 to 176.40 at λ=4, with an LPIPS of 0.1157. The resulting edits provide interpretable evidence, such as ventricular enlargement in MRI, consistent with known anatomical differences.

Key takeaway

For AI Scientists and Research Scientists analyzing high-dimensional data like medical images for group differences, this counterfactual explanation framework offers a crucial tool for interpretability. You should consider integrating deep-test-guided counterfactual editing to not only detect but also visually explain which specific features drive observed distributional shifts. This approach provides actionable insights into cohort differences, moving beyond mere p-values to reveal anatomically or semantically meaningful patterns, especially when optimizing for statistical effectiveness with a regularization weight like λ=4.

Key insights

Counterfactual explanations for deep two-sample tests reveal features driving distributional differences by generating minimal, discrepancy-reducing sample edits.

Principles

Optimize MMD in latent space for discrepancy reduction.
LPIPS regularization balances alignment and source similarity.
Statistical effectiveness, minimality, and realism involve trade-offs.

Method

The method encodes a source image into a semantic latent representation using a diffusion autoencoder, then perturbs this latent code by optimizing a DMMD-based counterfactual loss, guided by a pretrained deep two-sample test model, and finally decodes the modified latent code to generate the counterfactual image.

In practice

Apply to medical imaging for cohort difference analysis.
Use λ=4 for strong statistical performance in MRI.
Evaluate counterfactuals using Δp, Δt, LPIPS, and PRDC.

Topics

Counterfactual Explanations
Deep Two-Sample Testing
Diffusion Autoencoders
Medical Imaging
MRI Analysis
Distribution Shift Interpretation

Best for: AI Scientist, Research Scientist

Related on AIssential

See Counsel's argued verdicts on the open AI decisions leaders are weighing →

Open in AIssential →

Editorial summary, takeaway, and curation by AIssential. Original article published by stat.ML updates on arXiv.org.