Title resolution pending

A review of cooperative multi-agent deep reinforcement learning , volume = · 2023 · DOI 10.1007/s10489-022-04105-y

2 Pith papers cite this work. Polarity classification is still indexing.

2 Pith papers citing it

open at publisher browse 2 citing papers

Title metadata for this work has not finished resolving. The hub is built from the citation graph; the title resolver retries DOI and OpenAlex on its next pass.

representative citing papers

ARMS: Automatic Reward Shaping for Sparse-Reward Multi-Agent Reinforcement Learning

cs.MA · 2026-05-22 · unverdicted · novelty 7.0

ARMS is an automatic reward-shaping framework for sparse-reward MARL that uses trajectory ranking and conditional best-response reasoning to preserve Nash equilibria while improving sampling efficiency in pathfinding tasks.

Robust Instruction Compliance in Cooperative Multi-Agent Reinforcement Learning

cs.AI · 2026-05-12 · unverdicted · novelty 6.0

MAVIC corrects Bellman backups at instruction boundaries by adjusting the incoming objective and restoring continuation value, enabling consistent estimation under stochastic instruction switching in cooperative MARL.

citing papers explorer

Showing 2 of 2 citing papers.

ARMS: Automatic Reward Shaping for Sparse-Reward Multi-Agent Reinforcement Learning cs.MA · 2026-05-22 · unverdicted · none · ref 7
ARMS is an automatic reward-shaping framework for sparse-reward MARL that uses trajectory ranking and conditional best-response reasoning to preserve Nash equilibria while improving sampling efficiency in pathfinding tasks.
Robust Instruction Compliance in Cooperative Multi-Agent Reinforcement Learning cs.AI · 2026-05-12 · unverdicted · none · ref 7
MAVIC corrects Bellman backups at instruction boundaries by adjusting the incoming objective and restoring continuation value, enabling consistent estimation under stochastic instruction switching in cooperative MARL.

Title resolution pending

fields

years

verdicts

representative citing papers

citing papers explorer