pith. sign in

Divya Chaudhary

Identifiers

  • name variant Divya Chaudhary 0.60 · backfill

Papers (2)

  1. Pessimism's Paradox: Conservative Offline Training Amplifies Reward Hacking During Online Adaptation in Reasoning Models cs.LG · 2026 · author #4
  2. Linear Probes Detect Task Format, Not Reasoning Mode in Language Model Hidden States cs.CL · 2026 · author #4

Mentions

  • 2606.30627 #4 · arxiv_oai · confidence 0.70 Divya Chaudhary
  • 2606.02907 #4 · arxiv_oai · confidence 0.70 Divya Chaudhary

Frequent Coauthors