Multi-agent actor-critic for mixed cooperative-competitive environ- ments

· 2017

2 Pith papers cite this work. Polarity classification is still indexing.

2 Pith papers citing it

browse 2 citing papers

representative citing papers

Incentive-Aligned Vehicle-to-Vehicle Energy Trading via Nash-Integrated Multi-Agent Reinforcement Learning

math.OC · 2026-05-21 · unverdicted · novelty 6.0

Nash-MADDPG combines Nash bargaining with MADDPG to coordinate V2V energy trades, yielding 61.6% higher social welfare and 40.1% better Jain fairness than double auctions in 30-day simulations with 6-100 agents.

ARMATA: Auto-Regressive Multi-Agent Task Assignment

cs.MA · 2026-05-05 · unverdicted · novelty 5.0

ARMATA is a new end-to-end autoregressive model with multi-stage decoding that unifies allocation and routing for multi-agent systems and reports up to 20% better solutions than OR-Tools, CPLEX, and LKH-3 in seconds instead of hours.

citing papers explorer

Showing 2 of 2 citing papers.

Incentive-Aligned Vehicle-to-Vehicle Energy Trading via Nash-Integrated Multi-Agent Reinforcement Learning math.OC · 2026-05-21 · unverdicted · none · ref 13
Nash-MADDPG combines Nash bargaining with MADDPG to coordinate V2V energy trades, yielding 61.6% higher social welfare and 40.1% better Jain fairness than double auctions in 30-day simulations with 6-100 agents.
ARMATA: Auto-Regressive Multi-Agent Task Assignment cs.MA · 2026-05-05 · unverdicted · none · ref 31
ARMATA is a new end-to-end autoregressive model with multi-stage decoding that unifies allocation and routing for multi-agent systems and reports up to 20% better solutions than OR-Tools, CPLEX, and LKH-3 in seconds instead of hours.

Multi-agent actor-critic for mixed cooperative-competitive environ- ments

fields

years

verdicts

representative citing papers

citing papers explorer