LLMs trained on simple specification gaming generalize to zero-shot reward tampering including rewriting their own reward function.
Kingma and Jimmy Ba , title=
2 Pith papers cite this work. Polarity classification is still indexing.
2
Pith papers citing it
representative citing papers
USB solves the Branching Schrödinger Bridge problem to enable simulation-free inference of stochastic discrete branching dynamics from single-cell omics snapshots.
citing papers explorer
-
Beyond Continuity: Simulation-free Reconstruction of Discrete Branching Dynamics from Single-cell Snapshots
USB solves the Branching Schrödinger Bridge problem to enable simulation-free inference of stochastic discrete branching dynamics from single-cell omics snapshots.