SHIFT: Robust Double Machine Learning for Average Dose-Response Functions under Heavy-Tailed Contamination
Summary
SHIFT (Self-calibrated Heavy-tail Inlier-Fit with Tempering) is a robust Double Machine Learning (DML) estimator designed to address "functional smearing" in Average Dose-Response Function (ADRF) estimation, particularly under heavy-tailed contamination. Traditional DML pipelines using kernel-weighted local-linear smoothers suffer from unbounded functional influence, where a single outlier can bias the entire dose-response curve. SHIFT combines cross-fit nuisance orthogonalization with a kernel-local Welsch-loss second stage, optimized via Graduated Non-Convexity (GNC). Its key innovation is a defensive Ordinary Least Squares (OLS) refit that uses the Median Absolute Deviation (MAD) of post-GNC residuals for its inlier cutoff, rather than the raw-outcome MAD. This architectural choice reduces level-RMSE from 1.03 to 0.33 on localized-contamination stress tests at $p=0.25$, while maintaining performance on clean data. SHIFT also provides a non-uniform per-sample weight vector, achieving mean F1-scores of approximately 0.96 for outlier detection on Gaussian-jump DGPs. The estimator is paired with a six-technique Extreme Value Theory (EVT) diagnostic suite to help practitioners characterize tail behavior and choose appropriate robust methods.
Key takeaway
For AI Engineers and Research Scientists working on causal inference with continuous treatments, SHIFT offers a robust solution for Average Dose-Response Function estimation, especially when dealing with contaminated observational data. You should integrate SHIFT into your DML pipelines to improve shape recovery and gain a native outlier mask, particularly in scenarios with localized contamination. Additionally, leverage the accompanying EVT diagnostic suite to empirically understand the tail characteristics of your data, guiding your choice between SHIFT, Huber-DML, or Quantile-DML for optimal performance.
Key insights
SHIFT robustifies DML for ADRF by using post-GNC residual MAD for outlier detection, improving accuracy and providing an outlier mask.
Principles
- Redescending M-estimators mitigate unbounded functional influence.
- Post-GNC residual MAD improves outlier detection in localized contamination.
- EVT diagnostics inform robust estimator selection based on tail characteristics.
Method
SHIFT employs cross-fit nuisance orthogonalization, kernel-local GNC with Welsch loss, and a defensive OLS refit using post-GNC residual MAD for inlier cutoffs.
In practice
- Use SHIFT for robust ADRF estimation with outlier detection.
- Employ EVT diagnostics to classify tail behavior (e.g., Fréchet vs. Weibull).
- Consider linear nuisance models for robust DML under uniform contamination.
Topics
- Robust Double Machine Learning
- Average Dose-Response Function
- Heavy-Tailed Contamination
- Graduated Non-Convexity
- Extreme Value Theory Diagnostics
Code references
Best for: AI Engineer, Research Scientist, AI Scientist, Machine Learning Engineer, Data Scientist
Related on AIssential
Editorial summary, takeaway, and curation by AIssential. Original article published by stat.ML updates on arXiv.org.