Measuring ai agents’ progress on multi-step cyber attack scenarios

· 2026 · arXiv 2603.11214

4 Pith papers cite this work. Polarity classification is still indexing.

4 Pith papers citing it

read on arXiv browse 4 citing papers

citation-role summary

background 1

citation-polarity summary

background 1

representative citing papers

SecureForge: Finding and Preventing Vulnerabilities in LLM-Generated Code via Prompt Optimization

cs.CR · 2026-05-08 · unverdicted · novelty 6.0

SecureForge audits LLM code for vulnerabilities, builds a synthetic prompt corpus via Markovian sampling, and optimizes system prompts to cut security issues by up to 48% while preserving unit test performance, with zero-shot transfer to real prompts.

Latent Adversarial Detection: Adaptive Probing of LLM Activations for Multi-Turn Attack Detection

cs.CR · 2026-04-30 · unverdicted · novelty 6.0

Adversarial restlessness in LLM activations allows five scalar features to detect multi-turn prompt injections at 93.8% accuracy on synthetic data, with cross-model replication but source-dependent generalization to real-world chats.

Security Considerations for Multi-agent Systems

cs.CR · 2026-03-09 · unverdicted · novelty 6.0

No existing AI security framework covers a majority of the 193 identified multi-agent system threats in any category, with OWASP Agentic Security Initiative achieving the highest overall coverage at 65.3%.

Autonomous Adversary: Red-Teaming in the age of LLM

cs.CR · 2026-05-07 · unverdicted · novelty 5.0

Expert-defined action plans for LLM agents achieve higher task completion in lateral-movement scenarios than fully autonomous or self-scaffolded modes, but failures remain common due to brittle commands and state handling.

citing papers explorer

Showing 4 of 4 citing papers.

SecureForge: Finding and Preventing Vulnerabilities in LLM-Generated Code via Prompt Optimization cs.CR · 2026-05-08 · unverdicted · none · ref 6
SecureForge audits LLM code for vulnerabilities, builds a synthetic prompt corpus via Markovian sampling, and optimizes system prompts to cut security issues by up to 48% while preserving unit test performance, with zero-shot transfer to real prompts.
Latent Adversarial Detection: Adaptive Probing of LLM Activations for Multi-Turn Attack Detection cs.CR · 2026-04-30 · unverdicted · none · ref 1
Adversarial restlessness in LLM activations allows five scalar features to detect multi-turn prompt injections at 93.8% accuracy on synthetic data, with cross-model replication but source-dependent generalization to real-world chats.
Security Considerations for Multi-agent Systems cs.CR · 2026-03-09 · unverdicted · none · ref 98
No existing AI security framework covers a majority of the 193 identified multi-agent system threats in any category, with OWASP Agentic Security Initiative achieving the highest overall coverage at 65.3%.
Autonomous Adversary: Red-Teaming in the age of LLM cs.CR · 2026-05-07 · unverdicted · none · ref 11
Expert-defined action plans for LLM agents achieve higher task completion in lateral-movement scenarios than fully autonomous or self-scaffolded modes, but failures remain common due to brittle commands and state handling.

Measuring ai agents’ progress on multi-step cyber attack scenarios

citation-role summary

citation-polarity summary

fields

years

verdicts

roles

polarities

representative citing papers

citing papers explorer