GTF: Omnidirectional EPI Transformer for Light Field Super-Resolution
Summary
GTF, an omnidirectional Epipolar Plane Image (EPI) Transformer, is introduced for light field (LF) image super-resolution, explicitly modeling horizontal, vertical, 45-degree, and 135-degree EPIs within a unified reconstruction framework. This approach addresses the underexplored diagonal epipolar geometry in existing Transformer-based LF SR methods. GTF integrates directional EPI processing, MacPI-based prior injection, adaptive directional fusion, and a topology-preserving feed-forward network. The main GTF model achieved 32.78 dB on five standard LF SR benchmarks, while a lightweight GTF-Tiny variant, designed for efficiency, reached 32.57 dB with 0.915M parameters and 19.81 GFLOPs. In the NTIRE 2026 Light Field Image Super-Resolution Challenge, GTF submissions ranked 3rd on Track 1 and Track 3, and 4th on Track 2.
Key takeaway
For research scientists developing light field super-resolution models, you should investigate incorporating omnidirectional EPI processing, including diagonal epipolar geometry, into your Transformer architectures. This approach, demonstrated by GTF, can yield substantial performance gains in fidelity and efficiency, particularly for competitive benchmarks like NTIRE challenges. Evaluate the trade-offs between model complexity and performance using variants like GTF-Tiny to meet specific resource constraints.
Key insights
Omnidirectional EPI modeling, including diagonal views, significantly enhances light field super-resolution performance.
Principles
- Explicitly model all EPI directions.
- Integrate prior knowledge via MacPI.
- Fuse directional features adaptively.
Method
GTF processes horizontal, vertical, 45-degree, and 135-degree EPIs, injects MacPI priors, adaptively fuses directional features, and uses a topology-preserving feed-forward network for LF SR.
In practice
- Consider diagonal EPIs for LF SR.
- Implement adaptive directional fusion.
- Utilize lightweight model variants for efficiency.
Topics
- Light Field Super-Resolution
- EPI Transformer
- Omnidirectional Epipolar Geometry
- NTIRE 2026 Challenge
- Transformer Architecture
Best for: Research Scientist, AI Scientist, Machine Learning Engineer, Computer Vision Engineer
Related on AIssential
Editorial summary, takeaway, and curation by AIssential. Original article published by Takara TLDR - Daily AI Papers.