GeoDisaster: Benchmarking Orchestrated Agents for Operational Disaster Geo-Intelligence

2026-06-15 · Source: Computer Vision and Pattern Recognition · Field: Technology & Digital — Artificial Intelligence & Machine Learning, Environmental Science & Earth Systems, Data Science & Analytics · Depth: Expert, quick

Summary

GeoDisaster is a new operational geospatial disaster reasoning benchmark designed to advance Earth-observation analysis beyond current remote-sensing vision-language models (RS-VLMs). It comprises 2,921 verified instances across 43 question types and five task families, including deforestation monitoring, multi-hazard analysis, building-damage assessment, flood-safe routing, and Sentinel-1 SAR flood monitoring. The benchmark integrates diverse Earth Observation/Geographic Information System (EO/GIS) evidence, such as optical and SAR imagery, raster masks, vector geometries, road networks, and exposure layers, covering hazard detection, damage assessment, exposure estimation, and diagnostic report generation. Ground-truth answers are derived from executable geospatial workflows and deterministic consistency checks, eliminating the need for language model annotation. Alongside GeoDisaster, a multi-agent framework with 18 disaster-oriented tools is introduced, where role-specialized agents coordinate using explicit execution contracts, aligned through Role-Contract Expectation Alignment (RCEA). Experiments demonstrate that GeoDisaster effectively challenges existing RS-VLMs and agentic systems, while RCEA significantly enhances tool use, evidence grounding, state consistency, and decision generation.

Key takeaway

For AI Scientists and Machine Learning Engineers developing operational geo-intelligence systems, GeoDisaster highlights the need for robust tool-grounded spatial reasoning. You should consider adopting multi-agent frameworks with explicit execution contracts and Role-Contract Expectation Alignment (RCEA) to enhance agent coordination and evidence grounding. This approach improves decision generation for critical tasks like disaster monitoring and damage assessment, moving beyond current RS-VLM limitations.

Key insights

GeoDisaster benchmarks operational geo-intelligence using a multi-agent framework with RCEA for improved disaster reasoning and evidence grounding.

Principles

Operational geo-intelligence requires tool-grounded spatial reasoning.
Multi-agent coordination benefits from explicit execution contracts.
Deterministic consistency checks validate ground-truth answers.

Method

An orchestrated multi-agent framework uses 18 disaster-oriented tools. Role-specialized agents coordinate via explicit execution contracts, aligned by Role-Contract Expectation Alignment (RCEA) through failure-aware supervised fine-tuning and contract-grounded reinforcement learning.

In practice

Develop agents for specific disaster geo-intelligence roles.
Integrate heterogeneous EO/GIS data for comprehensive analysis.
Implement deterministic checks for answer validation.

Topics

GeoDisaster Benchmark
Multi-Agent Systems
Geospatial Intelligence
Disaster Management
Remote Sensing
Role-Contract Expectation Alignment

Best for: Computer Vision Engineer, AI Scientist, Research Scientist, Machine Learning Engineer

Related on AIssential

Open in AIssential →

Editorial summary, takeaway, and curation by AIssential. Original article published by Computer Vision and Pattern Recognition.