pith. sign in

Arash Ahmadian

Identifiers

No identifiers captured yet.

Papers (1)

  1. Back to Basics: Revisiting REINFORCE Style Optimization for Learning from Human Feedback in LLMs cs.LG · 2024 · author #1

Mentions

No mention provenance yet.

Frequent Coauthors