Proposes a conceptual framework defining alignment drift in long-term human-LLM interactions via signal distinctions, feedback loops, three regimes, and boundary conditions for control.
Title resolution pending
1 Pith paper cite this work. Polarity classification is still indexing.
1
Pith paper citing it
fields
cs.HC 1years
2026 1verdicts
UNVERDICTED 1representative citing papers
citing papers explorer
-
Alignment Drift in Long-Term Human-LLM Interaction: A Mechanism-Oriented Framework
Proposes a conceptual framework defining alignment drift in long-term human-LLM interactions via signal distinctions, feedback loops, three regimes, and boundary conditions for control.