pith. sign in

← back to paper

Review history

arxiv: 2604.17892 · 2 revisions

LEPO: Latent Reasoning Policy Optimization for Large Language Models

  1. 2026-05-12 UNVERDICTED LOW v0.9.0 novelty 5.0
    75976 ms 5478 in 1041 out 2026-05-12T03:54:33.433701+00:00
  2. 2026-05-10 UNVERDICTED LOW v0.9.0 novelty 6.0
    40851 ms 5478 in 1054 out 2026-05-10T05:42:41.780170+00:00