Qingpeng Cai
Identifiers
- name variant Qingpeng Cai 0.60 · backfill
Papers (10)
- Reinforced Preference Optimization for Reasoning-Augmented Recommendations cs.IR · 2026 · author #9
- Position-Aware Drafting for Inference Acceleration in LLM-Based Generative List-Wise Recommendation cs.IR · 2026 · author #5
- Phase-Aware Mixture of Experts for Agentic Reinforcement Learning cs.AI · 2026 · author #5
- When Importance Sampling Misallocates Credit: Asymmetric Ratios for Outcome-Supervised RL cs.CL · 2025 · author #3
- Reinforcement Learning Driven Heuristic Optimization cs.LG · 2019 · author #1
- Policy Optimization with Model-based Explorations cs.LG · 2018 · author #2
- Deterministic Policy Gradients With General State Transitions cs.LG · 2018 · author #1
- A Deep Reinforcement Learning Framework for Rebalancing Dockless Bike Sharing Systems cs.AI · 2018 · author #2
- Policy Gradients for Contextual Recommendations cs.LG · 2018 · author #2
- Reinforcement Mechanism Design for e-commerce cs.MA · 2017 · author #1
Mentions
- 2605.21967 #9 · arxiv_oai · confidence 0.70 Qingpeng Cai
- 2602.17038 #5 · arxiv_oai · confidence 0.70 Qingpeng Cai
- 2510.06062 #3 · arxiv_oai · confidence 0.70 Qingpeng Cai
Frequent Coauthors
- Pingzhong Tang 5 shared papers
- Ling Pan 3 shared papers
- Peng Jiang 3 shared papers
- Feiyang Pan 2 shared papers
- Kun Gai 2 shared papers
- Qing He 2 shared papers
- An-xiang Zeng 1 shared papers
- Aris Filos-Ratsikas 1 shared papers
- Azalia Mirhoseini 1 shared papers
- Chenxiao Fan 1 shared papers
- Chi Lu 1 shared papers
- Chongming Gao 1 shared papers
- Chun-Xiang Pan 1 shared papers
- Derong Xu 1 shared papers
- Fuzheng Zhang 1 shared papers
- Fuzhen Zhuang 1 shared papers
- George Tucker 1 shared papers
- Guorui Zhou 1 shared papers
- Haoyan Liu 1 shared papers
- Hualin He 1 shared papers