Architecture-Adaptive Uncertainty Fusion for Deepfake Detection
Summary
The Correlation-Optimized Fusion (COF) framework enhances deepfake detection by providing reliable prediction uncertainty, addressing the limitation of existing methods that ignore architecture-specific optimal uncertainty composition. COF fuses five complementary uncertainty sources—epistemic, aleatoric, calibration, conformal, and distributional—by maximizing Pearson correlation between fused scores and prediction errors through constrained optimization on the probability simplex. This architecture-adaptive approach requires no model modifications and optimizes weights in just 42 seconds, significantly faster than a 5-model Deep Ensemble's 20-45 hours. Evaluated across eleven architectures on FaceForensics++, CelebDF, and DFDC, COF achieves a mean in-domain Pearson ρ=0.438. Crucially, under distribution shift, COF outperforms Random Forest in 9/11 architectures on CelebDF with up to 7.3× higher correlation, retaining more signal (74% drop to ρ=0.116) compared to RF's 85% degradation. However, all UQ methods show catastrophic generalization failure cross-dataset, with mean correlation degrading 90.7% and seven architectures exhibiting uncertainty inversion.
Key takeaway
For AI Security Engineers deploying deepfake detectors in forensic contexts, prioritize COF or L1-COF for cross-domain reliability, as they offer significantly higher correlation under distribution shift compared to non-linear methods like Random Forest. You should prefer residual-CNN or larger EfficientNet architectures (e.g., ResNet50, EfficientNet-B4) to mitigate uncertainty inversion risks. Always include conformal prediction in your UQ systems and recognize that feature-space distance is the most stable uncertainty signal across domains. Be aware that all current UQ methods face catastrophic reliability collapse on highly shifted datasets.
Key insights
Correlation-Optimized Fusion (COF) adaptively combines multiple uncertainty sources to predict deepfake detection errors, improving cross-domain reliability.
Principles
- Optimal uncertainty fusion is architecture-dependent.
- Capacity-controlled fusion enhances cross-domain reliability.
- Feature-space distance offers stable cross-domain uncertainty.
Method
COF maximizes Pearson correlation between five normalized uncertainty sources and prediction errors via constrained optimization on the probability simplex using SLSQP, learning architecture-specific weights.
In practice
- Always include conformal prediction in UQ systems.
- Prioritize feature-space distance for cross-domain UQ.
- Use SC-Weight for resource-constrained UQ deployment.
Topics
- Deepfake Detection
- Uncertainty Quantification
- Correlation-Optimized Fusion
- Cross-Domain Generalization
- Conformal Prediction
- Machine Learning Forensics
Code references
Best for: Computer Vision Engineer, Research Scientist, AI Scientist, Machine Learning Engineer, AI Security Engineer
Related on AIssential
Editorial summary, takeaway, and curation by AIssential. Original article published by cs.CV updates on arXiv.org.