A Dual Edge Spatial Jacobian Image Graph for Interpretable Diabetic Retinopathy Grading

· Source: stat.ML updates on arXiv.org · Field: Health & Wellbeing — Artificial Intelligence & Machine Learning, Health & Medical Research, Medical Devices & Health Technology · Depth: Expert, medium

Summary

A dual-edge spatial–Jacobian image graph is introduced for interpretable diabetic retinopathy (DR) grading from color fundus photographs. This framework represents each image as a graph node, integrating four aligned evidence streams: AutoMorph vessel information ($X_{1}$), DR-XAI-style lesion evidence maps ($X_{2}$), a 128-dimensional lesion-based contrastive image embedding ($X_{3}$), and AutoMorph morphometric biomarkers ($X_{4}$). The system constructs two distinct edge branches: a spatial branch ($X_{12}$) encoding vessel–lesion geometry and a Jacobian branch ($X_{34}$) modeling embedding–biomarker sensitivity. These branches are fused using a lightweight two-token attention module. Evaluated on 2,910 non-augmented APTOS images, the full graph achieved 0.8076 accuracy, 0.8312 quadratic weighted kappa, 0.5915 macro-F1, and 0.9330 adjacent-grade accuracy. For referable DR, it reached 0.9055 accuracy and 0.9711 AUROC. The framework is designed as an explainable representation-learning tool for lesion–biomarker hypothesis generation, rather than a deployment-ready clinical classifier.

Key takeaway

For AI Scientists developing diagnostic tools for ophthalmology, you should consider graph-based approaches that explicitly model multi-modal relationships. This framework demonstrates how integrating spatial vessel-lesion geometry and embedding-biomarker sensitivity can yield more interpretable DR grading, moving beyond simple image-level labels. Your focus should shift towards building explainable representations that facilitate hypothesis generation, rather than solely optimizing for raw predictive performance in clinical deployment.

Key insights

Interpretable DR grading benefits from graph representations linking spatial lesion-vessel geometry and embedding-biomarker sensitivity.

Principles

Method

Fundus images are represented as graph nodes with four evidence streams. Spatial and Jacobian edge branches are constructed and fused via two-token attention for DR grading.

In practice

Topics

Code references

Best for: Computer Vision Engineer, AI Scientist, Research Scientist, Domain Expert

Related on AIssential

Open in AIssential →

Editorial summary, takeaway, and curation by AIssential. Original article published by stat.ML updates on arXiv.org.