pith. sign in

Jeonghoon Shim

Identifiers

No identifiers captured yet.

Papers (1)

  1. Your Language Model is Its Own Critic: Reinforcement Learning with Value Estimation from Actor's Internal States cs.LG · 2026 · author #5

Mentions

No mention provenance yet.

Frequent Coauthors