pith. sign in

Jingyi Song

Identifiers

  • name variant Jingyi Song 0.60 · backfill

Papers (1)

  1. DVAO: Dynamic Variance-adaptive Advantage Optimization for Multi-reward Reinforcement Learning cs.CL · 2026 · author #2

Mentions

  • 2605.25604 #2 · arxiv_oai · confidence 0.70 Jingyi Song

Frequent Coauthors