pith. sign in

Songyang Gao

Identifiers

  • name variant Songyang Gao 0.60 · backfill

Papers (4)

  1. ThoughtFold: Folding Reasoning Chains via Introspective Preference Learning cs.AI · 2026 · author #4
  2. Graphs of Research: Citation Evolution Graphs as Supervision for Research Idea Generation cs.CL · 2026 · author #1
  3. EVPO: Explained Variance Policy Optimization for Adaptive Critic Utilization in LLM Post-Training cs.LG · 2026 · author #7
  4. Secrets of RLHF in Large Language Models Part I: PPO cs.CL · 2023 · author #3

Mentions

  • 2606.03503 #4 · arxiv_oai · confidence 0.70 Songyang Gao
  • 2307.04964 #3 · arxiv_oai · confidence 0.70 Songyang Gao

Frequent Coauthors