arXiv preprint arXiv:2002.03939 , year=

Qatten: A general framework for cooperative multiagent reinforcement learning , author= · 2002 · arXiv 2002.03939

2 Pith papers cite this work. Polarity classification is still indexing.

2 Pith papers citing it

representative citing papers

Randomness is sometimes necessary for coordination

cs.AI · 2026-05-07 · conditional · novelty 7.0

Structured per-agent randomness via ranked masking in attention allows symmetric agents to break ties and coordinate, achieving perfect success on symmetric tasks where deterministic policies fail and enabling zero-shot transfer across team sizes.

Phi-Actor-Critic: Steering General-Sum Games to Pareto-Efficient Correlated Equilibria

cs.MA · 2026-06-09 · unverdicted · novelty 5.0

Phi-Actor-Critic is a new method that steers multi-agent reinforcement learning toward Pareto-efficient correlated equilibria using regret minimization and Lagrangian selection.

citing papers explorer

Showing 2 of 2 citing papers.

Randomness is sometimes necessary for coordination cs.AI · 2026-05-07 · conditional · none · ref 30
Structured per-agent randomness via ranked masking in attention allows symmetric agents to break ties and coordinate, achieving perfect success on symmetric tasks where deterministic policies fail and enabling zero-shot transfer across team sizes.
Phi-Actor-Critic: Steering General-Sum Games to Pareto-Efficient Correlated Equilibria cs.MA · 2026-06-09 · unverdicted · none · ref 37
Phi-Actor-Critic is a new method that steers multi-agent reinforcement learning toward Pareto-efficient correlated equilibria using regret minimization and Lagrangian selection.

arXiv preprint arXiv:2002.03939 , year=

fields

years

verdicts

representative citing papers

citing papers explorer