Directional Confusions Reveal Divergent Inductive Biases Through Rate-Distortion Geometry in Human and Machine Vision
Summary
A new study reveals that while humans and deep vision models can achieve similar classification accuracy, they exhibit systematically different patterns of errors, specifically in "directional confusions." These directional confusions, which indicate who is mistaken for whom and in what direction, expose distinct inductive biases not captured by accuracy metrics alone. Researchers quantified asymmetry in confusion matrices using a Rate-Distortion (RD) framework, characterized by three geometric signatures: slope (beta), curvature (kappa), and efficiency (AUC). The findings indicate that humans display broad but weak asymmetries, whereas deep vision models show sparser, stronger directional collapses. Robustness training in models reduced global asymmetry but did not replicate the human-like breadth-strength profile of graded similarity. Mechanistic simulations further demonstrated that varying asymmetry organizations shift the RD frontier in opposing directions, even when performance is matched.
Key takeaway
For Computer Vision Engineers evaluating model robustness, you should move beyond scalar accuracy metrics to analyze directional confusions and Rate-Distortion geometry. This approach provides a more nuanced understanding of inductive biases and how models generalize under distribution shifts, helping you identify and mitigate specific failure modes that robustness training alone may not address effectively.
Key insights
Directional confusions and Rate-Distortion geometry reveal distinct inductive biases in human and machine vision.
Principles
- Accuracy alone obscures critical inductive bias differences.
- Asymmetry in confusions signals divergent generalization geometry.
Method
The Rate-Distortion (RD) framework, using slope (beta), curvature (kappa), and efficiency (AUC), quantifies confusion matrix asymmetry to characterize inductive bias under distribution shift.
In practice
- Analyze directional confusions beyond scalar accuracy.
- Use RD geometry to compare model robustness profiles.
Topics
- Directional Confusions
- Inductive Biases
- Rate-Distortion Geometry
- Human Vision
- Machine Vision
Best for: Computer Vision Engineer, AI Scientist, Research Scientist
Related on AIssential
Editorial summary, takeaway, and curation by AIssential. Original article published by Computer Vision and Pattern Recognition.