Securing Multi-Agent Systems Against Corruptions via Node Contribution Backpropagation

Chengcan Wu; Meng Sun; Mingqian Xu; Zeming Wei; Zhixin Zhang

arxiv: 2510.19420 · v2 · pith:IGJZWRARnew · submitted 2025-10-22 · 💻 cs.CR · cs.AI· cs.LG· cs.MA· math.OC

Securing Multi-Agent Systems Against Corruptions via Node Contribution Backpropagation

Chengcan Wu , Zhixin Zhang , Mingqian Xu , Zeming Wei , Meng Sun This is my paper

classification 💻 cs.CR cs.AIcs.LGcs.MAmath.OC

keywords agentsmulti-agentcomplexcontributiondefensedefensesdynamicexisting

0 comments

read the original abstract

Multi-Agent Systems (MAS) have become a prevalent paradigm for Large Language Model (LLM) applications. However, the complex multi-agent design in MAS introduces unique trustworthiness concerns: adversarial agents can inject misleading information that propagates contagiously through the system, corrupting benign agents and leading to false outputs. Existing graph-based defenses model agents as nodes and communications as edges, yet are limited to static-graph defenses. In this paper, we propose a dynamic defense paradigm that models MAS communication as a signed directed acyclic graph and computes each agent's contribution to the final decision via backward propagation, enabling accurate identification and isolation of malicious agents to secure multi-agent task collaboration. Experimental results in complex and dynamic MAS environments demonstrate that our method notably outperforms existing MAS defense mechanisms, providing an effective guardrail for trustworthy MAS deployment. Our code is available at https://github.com/ChengcanWu/BPD.

This paper has not been read by Pith yet.

discussion (0)

Forward citations

Cited by 3 Pith papers

Reviewed papers in the Pith corpus that reference this work. Sorted by Pith novelty score.

MESA: Prioritizing Vulnerable Communication Channels for Securing Multi-Agent Systems
cs.CR 2026-06 unverdicted novelty 6.0

MESA ranks MAS communication edges by vulnerability via graph-theoretic metrics and dynamic probes, achieving mean Spearman ρ=+0.60 correlation with empirical per-edge attack success and 3x interception gain when moni...
KYA: A Framework-Agnostic Trust Layer for Autonomous Systems with Verifiable Provenance and Hierarchical Policy Composition
cs.CR 2026-05 unverdicted novelty 5.0

KYA provides a framework-agnostic trust layer using inbound pipelines, policy composition, unified trust scoring, interaction multipliers, and delegation attribution to ensure authorized, conforming, and verifiable ac...
Toward Secure LLM Agents: Threat Surfaces, Attacks, Defenses, and Evaluation
cs.CR 2026-06 unverdicted novelty 3.0

A synthesis of 247 papers on LLM agent security identifies prompt injection and tool hijacking as dominant threats, notes weakly compositional defenses, and argues for trust boundaries and realistic evaluations.