pith. sign in

Xumeng Wen

Identifiers

  • name variant Xumeng Wen 0.60 · backfill

Papers (2)

  1. PBSD: Privileged Bayesian Self-Distillation for Long-Horizon Credit Assignment cs.LG · 2026 · author #3
  2. Reinforcement Learning with Verifiable Rewards Implicitly Incentivizes Correct Reasoning in Base LLMs cs.AI · 2025 · author #1

Mentions

  • 2606.09348 #3 · arxiv_oai · confidence 0.70 Xumeng Wen

Frequent Coauthors