Audited Conformal Prediction for Classification under Unknown Distribution Shift
Summary
Audited Conformal Prediction (ACP) is a novel method for uncertainty quantification in classification models deployed under unknown distribution shifts. It addresses the degradation of predictive uncertainty estimates by integrating an auxiliary "audit" model, trained on a small labeled dataset from the target population, to identify inputs where the legacy model is likely to fail. ACP produces prediction sets that guarantee marginal coverage while significantly improving conditional coverage compared to existing approaches. The method offers two integration strategies: one for improved conditional performance with marginal coverage, and another for explicit group-conditional coverage guarantees. Experiments on synthetic data (e.g., K=5 classes, 100 features, 10,000 historical samples, 200-5,000 calibration samples) and real-world datasets like Camelyon17 and CIFAR-10/CIFAR-10-C demonstrate ACP's ability to balance reliability and efficiency, maintaining compact prediction set sizes while enhancing coverage for unreliable samples.
Key takeaway
For MLOps engineers deploying classification models in dynamic environments, Audited Conformal Prediction (ACP) offers a robust solution to maintain reliable uncertainty estimates under distribution shift. You should consider implementing ACP to improve conditional coverage for challenging samples without excessively inflating prediction set sizes. This approach allows you to adapt models efficiently with limited new labeled data, enhancing model trustworthiness and operational stability.
Key insights
Audited Conformal Prediction improves uncertainty quantification under distribution shift by using an auxiliary model to identify legacy model failures.
Principles
- An audit model can predict legacy model reliability without retraining the main model.
- Integrating audit model outputs into conformal prediction enhances conditional coverage.
- Data splitting is crucial for maintaining exchangeability and valid inference.
Method
Train a binary audit model îr on ij´ç²ç¹ to predict legacy model correctness. Integrate îr's output with the legacy model's predictions to construct conformal prediction sets on ij´ç²ç², ensuring marginal or group-conditional coverage.
In practice
- Use legacy model logits, confidence patterns, or intermediate features as audit model inputs.
- Employ random forests or neural networks for the audit model architecture.
Topics
- Conformal Prediction
- Distribution Shift
- Uncertainty Quantification
- Audit Models
- Classification Models
- Machine Learning Reliability
Code references
Best for: Research Scientist, Computer Vision Engineer, AI Scientist, Machine Learning Engineer, MLOps Engineer
Related on AIssential
Editorial summary, takeaway, and curation by AIssential. Original article published by stat.ML updates on arXiv.org.