Towards Security-Auditable LLM Agents: A Unified Graph Representation

· 2026 · cs.AI · arXiv 2605.06812

2 Pith papers cite this work. Polarity classification is still indexing.

2 Pith papers citing it

open full Pith review browse 2 citing papers arXiv PDF

abstract

LLM-based agentic systems are rapidly evolving to perform complex autonomous tasks through dynamic tool invocation, stateful memory management, and multi-agent collaboration. However, this semantics-driven execution paradigm creates a severe semantic gap between low-level physical events and high-level execution intent, making post-hoc security auditing fundamentally difficult. Existing representation mechanisms, including static SBOMs and runtime logs, provide only fragmented evidence and fail to capture cognitive-state evolution, capability bindings, persistent memory contamination, and cascading risk propagation across interacting agents. To bridge this gap, we propose Agent-BOM, a unified structural representation for agent security auditing. Agent-BOM models an agentic system as a hierarchical attributed directed graph that separates static capability bases, such as models, tools, and long-term memory, from dynamic runtime semantic states, such as goals, reasoning trajectories, and actions. These layers are connected through semantic edges and security attributes, transforming fragmented execution traces into queryable audit paths. Building on Agent-BOM, we develop a graph-query-based paradigm for path-level risk assessment and instantiate it with the OWASP Agentic Top 10. We further implement an auditing plugin in the OpenClaw environment to construct Agent-BOM from live executions. Evaluation on representative real-world agentic attack scenarios shows that Agent-BOM can reconstruct stealthy attack chains, including cross-session memory poisoning and tool misuse, capability supply-chain hijacking and unexpected code execution, multi-agent ecosystem hijacking, and privilege and trust abuse. These results demonstrate that Agent-BOM provides a unified and auditable foundation for root-cause analysis and security adjudication in complex agentic ecosystems.

representative citing papers

When Agents Do Not Stop: Uncovering Infinite Agentic Loops in LLM Agents

cs.SE · 2026-07-02 · unverdicted · novelty 7.0 · 2 refs

Infinite agentic loops are a distinct failure mode in LLM agents arising from unbounded feedback paths, and IAL-Scan detects them via framework-independent static analysis with 91.9% precision on 6,549 repositories.

AgentFlow: Building Agent Dependency Graphs for Static Analysis of Agent Programs

cs.SE · 2026-07-02 · unverdicted · novelty 7.0 · 2 refs

AgentFlow builds a framework-agnostic Agent Dependency Graph from agent program source code to support static analyses such as BOM generation and prompt-to-tool risk detection, evaluated on 5,399 real programs across five frameworks.

citing papers explorer

Showing 2 of 2 citing papers.

When Agents Do Not Stop: Uncovering Infinite Agentic Loops in LLM Agents cs.SE · 2026-07-02 · unverdicted · none · ref 15 · 2 links · internal anchor
Infinite agentic loops are a distinct failure mode in LLM agents arising from unbounded feedback paths, and IAL-Scan detects them via framework-independent static analysis with 91.9% precision on 6,549 repositories.
AgentFlow: Building Agent Dependency Graphs for Static Analysis of Agent Programs cs.SE · 2026-07-02 · unverdicted · none · ref 25 · 2 links · internal anchor
AgentFlow builds a framework-agnostic Agent Dependency Graph from agent program source code to support static analyses such as BOM generation and prompt-to-tool risk detection, evaluated on 5,399 real programs across five frameworks.

Towards Security-Auditable LLM Agents: A Unified Graph Representation

fields

years

verdicts

representative citing papers

citing papers explorer