Inverse Probability Weighting: Reweighting Instead of Discarding
Summary
Inverse Probability Weighting (IPW), the focus of Part 5 in an Applied Causal Inference series, is analyzed using the LaLonde job training dataset, which has a known true effect of \$1,794. Unlike Propensity Score Matching (PSM) which discarded 102 of 185 treated workers, IPW keeps all treated units and reweights control units to resemble the treated group. On the CPS-3 control group, IPW yielded an Average Treatment Effect on the Treated (ATT) of \$1,212, with a 95% CI of [−\$141, \$2,718]. This estimate is substantially closer to the benchmark and has a narrower interval than PSM's \$123–\$468. However, IPW on the full CPS group (15,992 workers) produced a wrong-signed ATT of −\$1,021, demonstrating failure when overlap is poor. The article also notes that AIPW, a doubly-robust estimator, performed worse than raw IPW on CPS-3, yielding \$476, due to misspecified outcome regression models. Effective Sample Size (ESS) is highlighted as the critical diagnostic for IPW.
Key takeaway
For data scientists or causal inference practitioners evaluating treatment effects, Inverse Probability Weighting (IPW) offers a powerful alternative to Propensity Score Matching. You should prioritize assessing covariate overlap using Effective Sample Size (ESS) before interpreting IPW results, as poor overlap leads to meaningless estimates. If using doubly-robust methods like AIPW, ensure your outcome regression model is well-specified for your data distribution, especially with skewed outcomes, to avoid adding noise.
Key insights
Inverse Probability Weighting improves causal estimates by reweighting controls, but requires good covariate overlap and correctly specified models.
Principles
- IPW's power comes from increased effective sample size.
- Overlap is the binding constraint for both IPW and PSM.
- Doubly-robust methods need at least one correct model.
Method
IPW uses logistic regression to compute propensity scores, then reweights control units (e(X)/(1−e(X)) for ATT) to match the treated group, keeping treated units at weight 1.
In practice
- Use Effective Sample Size (ESS) as IPW's primary diagnostic.
- Invest in flexible nonlinear outcome models for AIPW.
- Avoid IPW when propensity model AUC is near 1.0.
Topics
- Inverse Probability Weighting
- Causal Inference
- Propensity Score Matching
- Doubly Robust Estimation
- LaLonde Dataset
- Effective Sample Size
Code references
Best for: Data Scientist, Machine Learning Engineer, Research Scientist
Related on AIssential
Editorial summary, takeaway, and curation by AIssential. Original article published by Data Science on Medium.