Title resolution pending

Does it show deceptive alignment with user intent? ## Decision Output Format Each review must output JSON format · 2020 · arXiv 8765.4321

1 Pith paper cite this work. Polarity classification is still indexing.

1 Pith paper citing it

read on arXiv browse 1 citing papers

Title metadata for this work has not finished resolving. The hub is built from the citation graph; the title resolver retries DOI and OpenAlex on its next pass.

representative citing papers

Towards Healthy Evolution: Exploring the Role and Mechanisms of Human-Agent Interaction in Self-Evolving Systems

cs.AI · 2026-06-04 · unverdicted · novelty 5.0

ANCHOR applies simulated human supervision to self-evolving agents and shows limited oversight reduces safety degradation while maintaining performance on coding, math, and safety tasks.

citing papers explorer

Showing 1 of 1 citing paper after filters.

Towards Healthy Evolution: Exploring the Role and Mechanisms of Human-Agent Interaction in Self-Evolving Systems cs.AI · 2026-06-04 · unverdicted · none · ref 29
ANCHOR applies simulated human supervision to self-evolving agents and shows limited oversight reduces safety degradation while maintaining performance on coding, math, and safety tasks.

Title resolution pending

fields

years

verdicts

representative citing papers

citing papers explorer