pith. sign in

Zhuoyuan Hao

Identifiers

  • name variant Zhuoyuan Hao 0.60 · backfill

Papers (1)

  1. Reproducing, Analyzing, and Detecting Reward Hacking in Rubric-Based Reinforcement Learning cs.LG · 2026 · author #2

Mentions

  • 2606.04923 #2 · arxiv_oai · confidence 0.70 Zhuoyuan Hao

Frequent Coauthors