pith. sign in

Jiashu Yao

Identifiers

  • name variant Jiashu Yao 0.60 · backfill

Papers (5)

  1. Learning from Own Solutions: Self-Conditioned Credit Assignment for Reinforcement Learning with Verifiable Rewards cs.LG · 2026 · author #7
  2. PEC-Home: Interpretation of Progressively Elliptical Commands in Smart Homes cs.CL · 2026 · author #5
  3. Benchmarking Living-Screen-Native GUI Agents on Short-Video Platforms cs.CV · 2026 · author #1
  4. Utilizing and Calibrating Hindsight Process Rewards via Reinforcement with Mutual Information Self-Evaluation cs.CL · 2026 · author #1
  5. Policy Split: Incentivizing Dual-Mode Exploration in LLM Reinforcement with Dual-Mode Entropy Regularization cs.CL · 2026 · author #1

Mentions

  • 2606.18810 #7 · arxiv_oai · confidence 0.70 Jiashu Yao
  • 2606.18636 #5 · arxiv_oai · confidence 0.70 Jiashu Yao
  • 2606.04701 #1 · arxiv_oai · confidence 0.70 Jiashu Yao

Frequent Coauthors