pith. sign in

Eugene Tarassov

Identifiers

  • name variant Eugene Tarassov 0.60 · backfill

Papers (5)

  1. Offline Regularised Reinforcement Learning for Large Language Models Alignment cs.LG · 2024 · author #8
  2. Understanding the performance gap between online and offline alignment algorithms cs.LG · 2024 · author #6
  3. Developing, Evaluating and Scaling Learning Agents in Multi-Agent Environments cs.MA · 2022 · author #25
  4. Mastering the Game of Stratego with Model-Free Multiagent Reinforcement Learning cs.AI · 2022 · author #4
  5. Time-series Imputation of Temporally-occluded Multiagent Trajectories cs.LG · 2021 · author #4

Mentions

  • 2405.19107 #8 · arxiv_oai · confidence 0.70 Eugene Tarassov
  • 2405.08448 #6 · arxiv_oai · confidence 0.70 Eugene Tarassov
  • 2206.15378 #4 · arxiv_oai · confidence 0.70 Eugene Tarassov
  • 2209.10958 #25 · arxiv_oai · confidence 0.70 Eugene Tarassov
  • 2106.04219 #4 · arxiv_oai · confidence 0.70 Eugene Tarassov

Frequent Coauthors