Towards Security-Auditable LLM Agents: A Unified Graph Representation
Summary
Agent-BOM is a proposed unified structural representation designed to enhance security auditing for LLM-based agentic systems, which often struggle with a semantic gap between low-level events and high-level execution intent. This gap makes post-hoc security auditing difficult, as existing mechanisms like SBOMs and runtime logs offer fragmented evidence and fail to capture critical aspects such as cognitive-state evolution, capability bindings, memory contamination, and cascading risk. Agent-BOM models an agentic system as a hierarchical attributed directed graph, distinguishing between static capability bases (models, tools, long-term memory) and dynamic runtime semantic states (goals, reasoning trajectories, actions). These layers are interconnected via semantic edges and security attributes, transforming execution traces into queryable audit paths. The system facilitates graph-query-based risk assessment, instantiated with the OWASP Agentic Top 10, and has been implemented as an auditing plugin within the OpenClaw environment. Evaluations against real-world agentic attack scenarios, including cross-session memory poisoning and multi-agent ecosystem hijacking, demonstrate Agent-BOM's ability to reconstruct stealthy attack chains.
Key takeaway
For security architects and engineering leaders deploying LLM-based agentic systems, understanding Agent-BOM is crucial for establishing robust auditing capabilities. Your teams should consider integrating such a unified graph representation to overcome the semantic gap in current security tools, enabling more effective root-cause analysis and proactive identification of complex attack vectors like memory poisoning or tool misuse. This approach enhances the auditable foundation for securing intricate agentic ecosystems.
Key insights
Agent-BOM provides a unified graph representation for auditing LLM agents, bridging the semantic gap in security analysis.
Principles
- Separate static capabilities from dynamic states.
- Model agent systems as hierarchical attributed graphs.
- Transform traces into queryable audit paths.
Method
Agent-BOM constructs a hierarchical attributed directed graph from agent executions, separating static and dynamic components. It then uses graph queries for path-level risk assessment, aligning with frameworks like OWASP Agentic Top 10.
In practice
- Reconstruct stealthy attack chains.
- Identify cross-session memory poisoning.
- Detect capability supply-chain hijacking.
Topics
- LLM Agents
- Security Auditing
- Agent-BOM
- Graph Representation
- OWASP Agentic Top 10
Best for: Research Scientist, CTO, VP of Engineering/Data, AI Scientist, AI Security Engineer, AI Architect
Related on AIssential
Editorial summary, takeaway, and curation by AIssential. Original article published by cs.AI updates on arXiv.org.