ANCHOR applies simulated human supervision to self-evolving agents and shows limited oversight reduces safety degradation while maintaining performance on coding, math, and safety tasks.
Title resolution pending
1 Pith paper cite this work. Polarity classification is still indexing.
1
Pith paper citing it
fields
cs.AI 1years
2026 1verdicts
UNVERDICTED 1representative citing papers
citing papers explorer
-
Towards Healthy Evolution: Exploring the Role and Mechanisms of Human-Agent Interaction in Self-Evolving Systems
ANCHOR applies simulated human supervision to self-evolving agents and shows limited oversight reduces safety degradation while maintaining performance on coding, math, and safety tasks.