Qingpeng Cai

Identifiers

name variant Qingpeng Cai 0.60 · backfill

Papers (10)

Reinforced Preference Optimization for Reasoning-Augmented Recommendations cs.IR · 2026 · author #9
Position-Aware Drafting for Inference Acceleration in LLM-Based Generative List-Wise Recommendation cs.IR · 2026 · author #5
Phase-Aware Mixture of Experts for Agentic Reinforcement Learning cs.AI · 2026 · author #5
When Importance Sampling Misallocates Credit: Asymmetric Ratios for Outcome-Supervised RL cs.CL · 2025 · author #3
Reinforcement Learning Driven Heuristic Optimization cs.LG · 2019 · author #1
Policy Optimization with Model-based Explorations cs.LG · 2018 · author #2
Deterministic Policy Gradients With General State Transitions cs.LG · 2018 · author #1
A Deep Reinforcement Learning Framework for Rebalancing Dockless Bike Sharing Systems cs.AI · 2018 · author #2
Policy Gradients for Contextual Recommendations cs.LG · 2018 · author #2
Reinforcement Mechanism Design for e-commerce cs.MA · 2017 · author #1

Mentions

2605.21967 #9 · arxiv_oai · confidence 0.70 Qingpeng Cai
2602.17038 #5 · arxiv_oai · confidence 0.70 Qingpeng Cai
2510.06062 #3 · arxiv_oai · confidence 0.70 Qingpeng Cai

Frequent Coauthors

Pingzhong Tang 5 shared papers
Ling Pan 3 shared papers
Peng Jiang 3 shared papers
Feiyang Pan 2 shared papers
Kun Gai 2 shared papers
Qing He 2 shared papers
An-xiang Zeng 1 shared papers
Aris Filos-Ratsikas 1 shared papers
Azalia Mirhoseini 1 shared papers
Chenxiao Fan 1 shared papers
Chi Lu 1 shared papers
Chongming Gao 1 shared papers
Chun-Xiang Pan 1 shared papers
Derong Xu 1 shared papers
Fuzheng Zhang 1 shared papers
Fuzhen Zhuang 1 shared papers
George Tucker 1 shared papers
Guorui Zhou 1 shared papers
Haoyan Liu 1 shared papers
Hualin He 1 shared papers