Co-redteam: Orchestrated security discovery and exploitation with llm agents

· 2026 · arXiv 2602.02164

5 Pith papers cite this work. Polarity classification is still indexing.

5 Pith papers citing it

read on arXiv browse 5 citing papers

citation-role summary

background 1

citation-polarity summary

background 1

representative citing papers

Large Byte Model: Teaching Language Models About Compiled Code

cs.CR · 2026-06-01 · unverdicted · novelty 7.0

Presents a byte-native LLM with bespoke tokenizer achieving 69-98% accuracy on malware family and architecture classification from raw bytes.

Revelio: Cost-Efficient Agentic Memory Safety Vulnerability Detection For Repository-Scale Codebases

cs.CR · 2026-06-20 · unverdicted · novelty 6.0

Revelio combines LLMs, static analysis, and sanitizer-verified PoVs to scalably discover memory safety vulnerabilities in repository-scale code, finding 19 new bugs in long-fuzzed projects at low cost.

When LLMs Team Up: A Coordinated Attack Framework for Automated Cyber Intrusions

cs.CR · 2026-05-09 · unverdicted · novelty 6.0

CAESAR decomposes LLM-based intrusion workflows into five roles with bounded coordination protocols, yielding higher success rates and lower variance than single-agent baselines on 25 CTF tasks.

Hephaestus: Toward a Cybersecurity AI Scientist

cs.CR · 2026-06-29 · unverdicted · novelty 4.0

The paper proposes the Cybersecurity AI Scientist as a modular multi-agent architecture for automating cybersecurity research, distinguished by its focus on non-stationary threats and anchored in a four-zeros risk-trust-incident-energy frame.

Towards Cybersecurity SuperIntelligence (CSI): What's the best harness for cybersecurity?

cs.CR · 2026-05-27 · unverdicted · novelty 4.0

CSI meta-scaffold unifies five LLM agent harnesses; a blackboard multi-agent system solves 19/33 cybench challenges (57.6%) versus 15/33 for the best single scaffold.

citing papers explorer

Showing 5 of 5 citing papers after filters.

Large Byte Model: Teaching Language Models About Compiled Code cs.CR · 2026-06-01 · unverdicted · none · ref 2
Presents a byte-native LLM with bespoke tokenizer achieving 69-98% accuracy on malware family and architecture classification from raw bytes.
Revelio: Cost-Efficient Agentic Memory Safety Vulnerability Detection For Repository-Scale Codebases cs.CR · 2026-06-20 · unverdicted · none · ref 13
Revelio combines LLMs, static analysis, and sanitizer-verified PoVs to scalably discover memory safety vulnerabilities in repository-scale code, finding 19 new bugs in long-fuzzed projects at low cost.
When LLMs Team Up: A Coordinated Attack Framework for Automated Cyber Intrusions cs.CR · 2026-05-09 · unverdicted · none · ref 12
CAESAR decomposes LLM-based intrusion workflows into five roles with bounded coordination protocols, yielding higher success rates and lower variance than single-agent baselines on 25 CTF tasks.
Hephaestus: Toward a Cybersecurity AI Scientist cs.CR · 2026-06-29 · unverdicted · none · ref 21
The paper proposes the Cybersecurity AI Scientist as a modular multi-agent architecture for automating cybersecurity research, distinguished by its focus on non-stationary threats and anchored in a four-zeros risk-trust-incident-energy frame.
Towards Cybersecurity SuperIntelligence (CSI): What's the best harness for cybersecurity? cs.CR · 2026-05-27 · unverdicted · none · ref 20
CSI meta-scaffold unifies five LLM agent harnesses; a blackboard multi-agent system solves 19/33 cybench challenges (57.6%) versus 15/33 for the best single scaffold.

Co-redteam: Orchestrated security discovery and exploitation with llm agents

citation-role summary

citation-polarity summary

fields

years

verdicts

roles

polarities

representative citing papers

citing papers explorer