Title resolution pending

doi: 10 · 2024 · arXiv 2024.111545

1 Pith paper cite this work. Polarity classification is still indexing.

1 Pith paper citing it

Title metadata for this work has not finished resolving. The hub is built from the citation graph; the title resolver retries DOI and OpenAlex on its next pass.

representative citing papers

Self-Play Reinforcement Learning under Imperfect Information in Big 2

cs.LG · 2026-05-21 · unverdicted · novelty 3.0

PPO with moderate entropy regularization and current-policy self-play outperforms Monte Carlo Q, SARSA, and Q-learning in a controlled self-play framework for the imperfect-information game Big 2.

citing papers explorer

Showing 1 of 1 citing paper.

Self-Play Reinforcement Learning under Imperfect Information in Big 2 cs.LG · 2026-05-21 · unverdicted · none · ref 6
PPO with moderate entropy regularization and current-policy self-play outperforms Monte Carlo Q, SARSA, and Q-learning in a controlled self-play framework for the imperfect-information game Big 2.

Title resolution pending

fields

years

verdicts

representative citing papers

citing papers explorer