DRIFT: From Robustness Gaps to Invariance Manifolds for AI-Generated Image Detection
Summary
DRIFT is a novel method for detecting AI-generated images, addressing the limitations of current detectors in open-world scenarios with unseen generative models. Unlike existing training-free approaches that rely on fixed invariance geometry in frozen vision foundation models (VFMs), DRIFT formulates detection as learning a structured invariance manifold of real images under one-class supervision. It integrates lightweight projection heads onto a frozen VFM, partitioning the representation space into robust and fragile subspaces. The robust subspace is explicitly trained to suppress variations from physically plausible imaging transformations, approximating tangent directions of a real-image manifold, while the fragile subspace maintains sensitivity to edit-like perturbations. A structured ordering margin enforces hierarchical separation, enabling detection via a margin-violation test. At inference, multi-scale patch-wise drift provides a dual-channel invariance signature and interpretable localization, demonstrating superior open-world generalization and outperforming robustness-based baselines.
Key takeaway
For Computer Vision Engineers developing AI-generated image detection systems, especially in open-world settings with evolving generative models, you should consider adopting manifold-learning approaches like DRIFT. This method provides superior open-world generalization and interpretable localization compared to robustness-gap baselines, offering a more reliable strategy for identifying synthetic content. Implementing such a system could significantly enhance your detection capabilities against novel AI generators.
Key insights
AI-generated image detection can be framed as learning a real-image invariance manifold to identify margin violations.
Principles
- Decompose VFM representation space into robust and fragile subspaces.
- Train robust subspaces to approximate real-image manifold tangents.
- Enforce hierarchical separation between physical invariance and edit variability.
Method
DRIFT uses lightweight projection heads on a frozen VFM, training a robust subspace for physical invariance and a fragile subspace for edit sensitivity, detecting fakes via a structured ordering margin violation.
In practice
- Achieve strong open-world generalization across unseen image generators.
- Generate interpretable localization maps showing invariance violations.
Topics
- AI-Generated Image Detection
- Invariance Manifolds
- Vision Foundation Models
- Representation Learning
- Open-World Generalization
- Deepfake Detection
Best for: Research Scientist, AI Scientist, Computer Vision Engineer, Machine Learning Engineer
Related on AIssential
Editorial summary, takeaway, and curation by AIssential. Original article published by Computer Vision and Pattern Recognition.