pith. sign in

Ryotaro Kawata

Identifiers

  • name variant Ryotaro Kawata 0.60 · backfill

Papers (1)

  1. How Neural Reward Models Learn Features for Policy Optimization: A Single-Index Analysis stat.ML · 2026 · author #2

Mentions

  • 2605.24749 #2 · arxiv_oai · confidence 0.70 Ryotaro Kawata

Frequent Coauthors