MapSR: Prompt-Driven Land Cover Map Super-Resolution via Vision Foundation Models

· Source: Computer Vision and Pattern Recognition · Field: Technology & Digital — Artificial Intelligence & Machine Learning, Geospatial AI · Depth: Expert, quick

Summary

MapSR is a novel prompt-driven framework designed for land-cover map super-resolution, addressing the high cost associated with dense high-resolution (HR) annotations. This framework enhances coarse low-resolution (LR) land-cover products into HR maps at the resolution of input imagery. Unlike existing weakly supervised methods that retrain dense predictors, MapSR decouples supervision from model training. It extracts class prompts from frozen vision foundation model features using a lightweight linear probe, leveraging LR labels only once. High-resolution mapping then proceeds via training-free metric inference and graph-based prediction refinement. On the Chesapeake Bay dataset, MapSR achieved 59.64% mIoU without any HR labels, competing with strong weakly supervised baselines and outperforming a fully supervised baseline. Crucially, MapSR reduces trainable parameters by four orders of magnitude and shortens training time from hours to minutes.

Key takeaway

For AI Scientists and Computer Vision Engineers developing land-cover mapping solutions, MapSR offers a compelling approach to achieve high-resolution outputs with significantly reduced annotation and computational burdens. You should consider integrating prompt-driven frameworks like MapSR to accelerate development cycles and deploy scalable HR mapping under tight budget constraints, especially when dense HR labels are prohibitively expensive or unavailable.

Key insights

MapSR uses prompt-driven vision foundation models for efficient, training-free land-cover super-resolution with minimal annotation.

Principles

Method

MapSR extracts class prompts from frozen vision foundation model features via a linear probe, then performs HR mapping using training-free metric inference and graph-based prediction refinement.

In practice

Topics

Code references

Best for: AI Scientist, Computer Vision Engineer, Research Scientist

Related on AIssential

Open in AIssential →

Editorial summary, takeaway, and curation by AIssential. Original article published by Computer Vision and Pattern Recognition.