pith. sign in

Zhen Shu

Identifiers

  • name variant Zhen Shu 0.60 · backfill

Papers (1)

  1. ExTra: Exploratory Trajectory Optimization for Language Model Reinforcement Learning cs.LG · 2026 · author #3

Mentions

  • 2606.24994 #3 · arxiv_oai · confidence 0.70 Zhen Shu

Frequent Coauthors