pith. sign in

Advances in Neural Information Processing Systems , volume=

4 Pith papers cite this work. Polarity classification is still indexing.

4 Pith papers citing it

citation-role summary

background 1

citation-polarity summary

years

2026 4

verdicts

UNVERDICTED 4

roles

background 1

polarities

background 1

clear filters

representative citing papers

Staleness-Learning Rate Scaling Laws for Asynchronous RLHF

cs.LG · 2026-07-01 · unverdicted · novelty 5.0

Stale rollouts introduce O(S * eta) surrogate-gradient bias in async GRPO, yielding stability condition eta << min{R_batch / (S * G_upd), R_crit / (T * G_upd)} under smoothness assumptions.

citing papers explorer

Showing 0 of 0 citing papers after filters.

No citing papers match the current filters.