Propagating Structural Guidance: Synthesizing Fluorescein Angiography from Fundus Images and Sparse OCT Scans
Summary
A novel framework synthesizes Fluorescein Angiography (FFA) images from Color Fundus Photography (CFP) by incorporating structural guidance from Optical Coherence Tomography (OCT). This approach addresses limitations of prior CFP-to-FFA synthesis methods, which relied solely on surface texture and struggled to reconstruct functional vascular information. The researchers constructed the first tri-modally aligned retinal imaging dataset, comprising paired CFP, FFA, and OCT scans from 3,676 patient eyes. Key components include the Spatially Aligned Cross-Modal Fusion (SACMF) module, which projects depth-resolved OCT features onto the fundus plane and injects them into the CFP encoder, and Token-wise Cross-Modality Alignment (TCMA), a contrastive learning strategy aligning CFP and FFA representations. Published on 2026-06-15, the method demonstrates superior synthesis performance and significantly improves downstream disease diagnosis, highlighting its clinical potential as a non-invasive decision-support tool.
Key takeaway
For AI scientists developing non-invasive retinal diagnostic tools, you should consider integrating multi-modal data like OCT scans with fundus photography. This approach, demonstrated by the SACMF and TCMA modules, significantly improves synthesized FFA image quality and downstream disease diagnosis performance. Explore the provided code to adapt this structural guidance framework, enhancing your models' ability to detect subtle vascular abnormalities without invasive procedures.
Key insights
Integrating OCT structural guidance with CFP significantly enhances non-invasive FFA synthesis for improved retinal vascular assessment.
Principles
- Multi-modal data fusion enhances image synthesis.
- Structural guidance improves functional information reconstruction.
- Token-level alignment refines cross-modal representations.
Method
The framework uses a Spatially Aligned Cross-Modal Fusion (SACMF) module to project OCT features onto the fundus plane for CFP encoder injection, complemented by Token-wise Cross-Modality Alignment (TCMA) for spatial representation alignment.
In practice
- Develop non-invasive retinal disease diagnosis tools.
- Enhance FFA synthesis for clinical decision support.
- Utilize multi-modal datasets for medical imaging.
Topics
- Fluorescein Angiography Synthesis
- Retinal Imaging
- Multi-modal Fusion
- Optical Coherence Tomography
- Fundus Photography
- Medical Image Analysis
- Disease Diagnosis
Code references
Best for: Computer Vision Engineer, AI Scientist, Machine Learning Engineer, Research Scientist
Related on AIssential
Editorial summary, takeaway, and curation by AIssential. Original article published by Artificial Intelligence.