pith. sign in

← back to paper

Review history

arxiv: 2604.26326 · 2 revisions

Addressing Performance Saturation for LLM RL via Precise Entropy Curve Control

  1. 2026-05-12 UNVERDICTED LOW v0.9.0 novelty 6.0
    70905 ms 5545 in 1364 out 2026-05-12T01:52:06.524094+00:00
  2. 2026-05-07 UNVERDICTED LOW v0.9.0 novelty 6.0
    48262 ms 5547 in 1371 out 2026-05-07T13:30:22.414407+00:00