Strategic Feature Selection
Summary
A new study, "Strategic Feature Selection" by Kaur et al., formally investigates strategic classification using feature selection and ridge regularization, focusing on high-stakes domains like healthcare. It reveals that excluding individual features based solely on their manipulability is generally suboptimal. The research provides a detailed characterization of feature subset performance under optimal regularization, offering new insights for policy design. A practical algorithm is introduced for jointly selecting the feature set and the ridge regularization level. Through a real-world case study on a healthcare payments benchmark, simulating Medicare Advantage "upcoding" projected to cost \$40 billion in 2025, the algorithm effectively guides coarse policy levers, significantly enhancing strategic robustness while maintaining predictive accuracy.
Key takeaway
For policymakers designing algorithmic decision-making systems in high-stakes domains like healthcare, you should move beyond simply excluding highly manipulable features. Instead, jointly optimize feature selection with regularization, considering both a feature's predictability and its relative manipulability. Prioritize retaining predictive, manipulable feature groups if their manipulation costs are homogeneous, or seek less manipulable, correlated proxies to maintain predictive value while reducing strategic vulnerability. This approach offers a principled framework for mitigating strategic behavior.
Key insights
Excluding features based on manipulability alone is suboptimal; joint consideration of manipulability and predictability is key.
Principles
- Feature selection and regularization must be tuned jointly.
- Predictive manipulable groups can be retained if costs are homogeneous.
- Less manipulable correlated proxies can replace manipulable features.
Method
A two-stage procedure performs continuous relaxation of combinatorial support selection, followed by local support refinement, then joint optimization of feature set and regularization.
In practice
- Use targeted audits to equalize manipulation costs on features.
- Employ correlated alternative features as less manipulable proxies.
Topics
- Strategic Classification
- Feature Selection
- Ridge Regularization
- Healthcare Payments
- Medicare Advantage
- Algorithmic Decision-Making
Code references
Best for: Research Scientist, AI Scientist, Machine Learning Engineer, Policy Maker
Related on AIssential
Editorial summary, takeaway, and curation by AIssential. Original article published by stat.ML updates on arXiv.org.