Constitutional Black-Box Monitoring for Scheming in

Storf, Simon, Barton-Cooper, Rich, Peters-Gill, James, Hobbhahn, Marius , journal=

1 Pith paper cite this work. Polarity classification is still indexing.

1 Pith paper citing it

browse 1 citing papers

representative citing papers

cs.AI · 2026-05-12 · unverdicted · novelty 6.0

Frontier LLMs miss dangerous actions in long coding agent transcripts 2-30 times more often after hundreds of thousands of benign tokens.

Showing 1 of 1 citing paper.

Classifier Context Rot: Monitor Performance Degrades with Context Length cs.AI · 2026-05-12 · unverdicted · none · ref 7
Frontier LLMs miss dangerous actions in long coding agent transcripts 2-30 times more often after hundreds of thousands of benign tokens.