pith. sign in

Shiping Gao

Identifiers

  • name variant Shiping Gao 0.60 · backfill

Papers (2)

  1. Unleashing Implicit Rewards: Prefix-Value Learning for Distribution-Level Optimization cs.CL · 2026 · author #1
  2. Stabilizing Policy Optimization via Logits Convexity cs.LG · 2026 · author #4

Mentions

  • 2603.00963 #4 · arxiv_oai · confidence 0.70 Shiping Gao
  • 2604.13197 #1 · arxiv_oai · confidence 0.70 Shiping Gao

Frequent Coauthors