arXiv preprint arXiv:2002.03939 , year=

URLhttps://arxiv · 2002 · arXiv 2002.03939

3 Pith papers cite this work. Polarity classification is still indexing.

3 Pith papers citing it

representative citing papers

Randomness is sometimes necessary for coordination

cs.AI · 2026-05-07 · conditional · novelty 7.0

Structured per-agent randomness via ranked masking in attention allows symmetric agents to break ties and coordinate, achieving perfect success on symmetric tasks where deterministic policies fail and enabling zero-shot transfer across team sizes.

Stagnant Neuron: Towards Understanding the Plasticity Loss in Multi-Agent Reinforcement Learning Value Factorization Methods

cs.LG · 2026-06-24 · unverdicted · novelty 6.0

KNIFE targets stagnant neurons in MARL value factorization by replacing them with a composite of frozen, re-initialized, and compensating units to restore plasticity while preserving cooperation knowledge.

Phi-Actor-Critic: Steering General-Sum Games to Pareto-Efficient Correlated Equilibria

cs.MA · 2026-06-09 · unverdicted · novelty 5.0

Phi-Actor-Critic is a new method that steers multi-agent reinforcement learning toward Pareto-efficient correlated equilibria using regret minimization and Lagrangian selection.

citing papers explorer

Showing 1 of 1 citing paper after filters.

Randomness is sometimes necessary for coordination cs.AI · 2026-05-07 · conditional · none · ref 30
Structured per-agent randomness via ranked masking in attention allows symmetric agents to break ties and coordinate, achieving perfect success on symmetric tasks where deterministic policies fail and enabling zero-shot transfer across team sizes.

arXiv preprint arXiv:2002.03939 , year=

fields

years

verdicts

representative citing papers

citing papers explorer