Redacted by arXiv
Identifiers
- name variant Redacted by arXiv 0.60 · backfill
Papers (2)
- LamPO: A Lambda Style Policy Optimization for Reasoning Language Models cs.CL · 2026 · author #1
- LambdaPO: A Lambda Style Policy Optimization for Reasoning Language Models cs.CL · 2026 · author #1
Mentions
- 2605.21235 #1 · arxiv_oai · confidence 0.70 Redacted by arXiv
- 2605.19416 #1 · arxiv_oai · confidence 0.70 Redacted by arXiv
Frequent Coauthors
- Bowen Deng 2 shared papers
- Jinghan Li 2 shared papers
- Liang Zhao 2 shared papers
- Xinyuan Chen 2 shared papers
- Yipeng Zhou 2 shared papers
- Zhiqian Chen 2 shared papers