DeXposure-Claw: An Agentic System for DeFi Risk Supervision

· Source: cs.CL updates on arXiv.org · Field: Finance & Economics — FinTech & Digital Financial Services, Capital Markets & Investment Management, Financial Risk Management · Depth: Expert, extended

Summary

DeXposure-Claw is an agentic system designed for decentralized finance (DeFi) risk supervision, addressing the limitations of general-purpose LLM agents in handling fast-moving, networked credit risks. The system routes LLM decisions through structured evidence, beginning with DeXposure-FM, a graph time-series foundation model that forecasts future exposure networks. These forecasts are then processed by deterministic monitors and stress scenarios, generating typed alerts, attribution signals, and scenario evidence. Data-health and confidence gates critically constrain escalation before DeXposure-Claw emits auditable supervisory tickets with rationales. Evaluated using DeXposure-Bench, a six-axis harness with a regulator-aligned absolute-loss ground truth, the system demonstrated improved ticket F1 from 0.0076 to 0.0288 with Claude Sonnet 4.6. However, the LLM component misfires on approximately 37-44% of interventions, underscoring that safety relies on the data-health and confidence gates and human review, rather than the decision model alone.

Key takeaway

For AI Security Engineers or ML Engineers deploying LLM agents in high-stakes DeFi supervision, recognize that direct LLM reasoning over raw on-chain data is unsafe due to over-intervention. You should prioritize integrating robust forecast-grounded evidence and explicit data-health and confidence gates into your system design. This approach improves auditability and coverage, but be aware that LLM over-reading persists, making human-in-the-loop review and strong gating mechanisms, not just model choice, paramount for safety.

Key insights

High-stakes DeFi supervision requires LLM agents to process forecast-grounded, structured evidence, constrained by safety gates.

Principles

Method

Forecast DeFi exposure graphs with a graph time-series FM, generate alerts and stress scenarios, then use an LLM to draft tickets, which are released only after passing data-health and confidence gates.

In practice

Topics

Code references

Best for: Research Scientist, AI Scientist, Machine Learning Engineer, AI Security Engineer

Related on AIssential

Open in AIssential →

Editorial summary, takeaway, and curation by AIssential. Original article published by cs.CL updates on arXiv.org.