pith. machine review for the scientific record. sign in

Hejia Geng

Identifiers

No identifiers captured yet.

Papers (4)

  1. SUDP: Secret-Use Delegation Protocol for Agentic Systems cs.CR · 2026 · author #2
  2. PAPO: Stabilizing Rubric Integration Training via Decoupled Advantage Normalization cs.AI · 2026 · author #5
  3. Scaling Behaviors of LLM Reinforcement Learning Post-Training: An Empirical Study in Mathematical Reasoning cs.LG · 2025 · author #2
  4. The Landscape of Agentic Reinforcement Learning for LLMs: A Survey cs.AI · 2025 · author #2

Mentions

No mention provenance yet.

Frequent Coauthors