Architecture-Adaptive Uncertainty Fusion for Deepfake Detection

· Source: cs.CV updates on arXiv.org · Field: Technology & Digital — Artificial Intelligence & Machine Learning, Cybersecurity & Data Privacy · Depth: Expert, extended

Summary

The Correlation-Optimized Fusion (COF) framework enhances deepfake detection by providing reliable prediction uncertainty, addressing the limitation of existing methods that ignore architecture-specific optimal uncertainty composition. COF fuses five complementary uncertainty sources—epistemic, aleatoric, calibration, conformal, and distributional—by maximizing Pearson correlation between fused scores and prediction errors through constrained optimization on the probability simplex. This architecture-adaptive approach requires no model modifications and optimizes weights in just 42 seconds, significantly faster than a 5-model Deep Ensemble's 20-45 hours. Evaluated across eleven architectures on FaceForensics++, CelebDF, and DFDC, COF achieves a mean in-domain Pearson ρ=0.438. Crucially, under distribution shift, COF outperforms Random Forest in 9/11 architectures on CelebDF with up to 7.3× higher correlation, retaining more signal (74% drop to ρ=0.116) compared to RF's 85% degradation. However, all UQ methods show catastrophic generalization failure cross-dataset, with mean correlation degrading 90.7% and seven architectures exhibiting uncertainty inversion.

Key takeaway

For AI Security Engineers deploying deepfake detectors in forensic contexts, prioritize COF or L1-COF for cross-domain reliability, as they offer significantly higher correlation under distribution shift compared to non-linear methods like Random Forest. You should prefer residual-CNN or larger EfficientNet architectures (e.g., ResNet50, EfficientNet-B4) to mitigate uncertainty inversion risks. Always include conformal prediction in your UQ systems and recognize that feature-space distance is the most stable uncertainty signal across domains. Be aware that all current UQ methods face catastrophic reliability collapse on highly shifted datasets.

Key insights

Correlation-Optimized Fusion (COF) adaptively combines multiple uncertainty sources to predict deepfake detection errors, improving cross-domain reliability.

Principles

Method

COF maximizes Pearson correlation between five normalized uncertainty sources and prediction errors via constrained optimization on the probability simplex using SLSQP, learning architecture-specific weights.

In practice

Topics

Code references

Best for: Computer Vision Engineer, Research Scientist, AI Scientist, Machine Learning Engineer, AI Security Engineer

Related on AIssential

Open in AIssential →

Editorial summary, takeaway, and curation by AIssential. Original article published by cs.CV updates on arXiv.org.