Arash Ahmadian
Identifiers
No identifiers captured yet.
Papers (1)
- Back to Basics: Revisiting REINFORCE Style Optimization for Learning from Human Feedback in LLMs cs.LG · 2024 · author #1
Mentions
No mention provenance yet.
Frequent Coauthors
- Ahmet \"Ust\"un 1 shared papers
- Chris Cremer 1 shared papers
- Julia Kreutzer 1 shared papers
- Marzieh Fadaee 1 shared papers
- Matthias Gall\'e 1 shared papers
- Olivier Pietquin 1 shared papers
- Sara Hooker 1 shared papers