Lu Pan
Identifiers
- name variant Lu Pan 0.60 · backfill
Papers (6)
- Rethinking Continual Experience Internalization for Self-Evolving LLM Agents cs.CL · 2026 · author #8
- SIRI: Self-Internalizing Reinforcement Learning with Intrinsic Skills for LLM Agent Training cs.AI · 2026 · author #10
- Skill or Skip? Learning Selective Skill Invocation in Agentic Tasks via Dual-Granularity Preference Learning cs.CL · 2026 · author #8
- From $\log \pi$ to $\pi$: Taming Divergence in Soft Clipping via Bilateral Decoupled Decay of Probability Gradient Weight cs.LG · 2026 · author #8
- How to Allocate, How to Learn? Dynamic Rollout Allocation and Advantage Modulation for Policy Optimization cs.LG · 2026 · author #7
- MASPO: Unifying Gradient Utilization, Probability Mass, and Signal Reliability for Robust and Sample-Efficient LLM Reasoning cs.LG · 2026 · author #8
Mentions
- 2606.04703 #8 · arxiv_oai · confidence 0.70 Lu Pan
- 2606.02355 #10 · arxiv_oai · confidence 0.70 Lu Pan
- 2606.00510 #8 · arxiv_oai · confidence 0.70 Lu Pan
Frequent Coauthors
- Ke Zeng 6 shared papers
- Cong Qin 4 shared papers
- Jiaye Lin 4 shared papers
- Chaowen Hu 3 shared papers
- Xiaoliang Fu 3 shared papers
- Xunliang Cai 3 shared papers
- Yangyi Fang 3 shared papers
- Binbin Zheng 2 shared papers
- Yangen Hu 2 shared papers
- Zekai Shao 2 shared papers
- Chenxing Sun 1 shared papers
- Chishui Chen 1 shared papers
- Fei Huang 1 shared papers
- Haolin Shi 1 shared papers
- Jingwen Chen 1 shared papers
- Junxi Wang 1 shared papers
- Leyi Wei 1 shared papers
- Meng Hsuan Yu 1 shared papers
- Shaodong Zheng 1 shared papers
- Shengda Fan 1 shared papers