pith. sign in

Tianpei Yang

Identifiers

  • name variant Tianpei Yang 0.60 · backfill

Papers (5)

  1. Tool-Aware Optimization with Entropy Guidance for Efficient Agentic Reinforcement Learning cs.LG · 2026 · author #5
  2. Thinking-Based Non-Thinking: Solving the Reward Hacking Problem in Training Hybrid Reasoning Models via Reinforcement Learning cs.AI · 2026 · author #4
  3. Learning Shaping Strategies in Human-in-the-loop Interactive Reinforcement Learning cs.HC · 2018 · author #2
  4. Towards Efficient Detection and Optimal Response against Sophisticated Opponents cs.MA · 2018 · author #1
  5. Hierarchical Heuristic Learning towards Effcient Norm Emergence cs.MA · 2018 · author #1

Mentions

  • 2601.04805 #4 · arxiv_oai · confidence 0.70 Tianpei Yang
  • 2606.03762 #5 · arxiv_oai · confidence 0.70 Tianpei Yang

Frequent Coauthors