pith. sign in

arxiv: 2305.05566 · v1 · pith:TKX4ZDEFnew · submitted 2023-05-09 · 💻 cs.LG · cs.AI· cs.MA

SMAClite: A Lightweight Environment for Multi-Agent Reinforcement Learning

classification 💻 cs.LG cs.AIcs.MA
keywords smacsmaclitemarlmulti-agentstarcraftalgorithmschallengeenvironment
0
0 comments X
read the original abstract

There is a lack of standard benchmarks for Multi-Agent Reinforcement Learning (MARL) algorithms. The Starcraft Multi-Agent Challenge (SMAC) has been widely used in MARL research, but is built on top of a heavy, closed-source computer game, StarCraft II. Thus, SMAC is computationally expensive and requires knowledge and the use of proprietary tools specific to the game for any meaningful alteration or contribution to the environment. We introduce SMAClite -- a challenge based on SMAC that is both decoupled from Starcraft II and open-source, along with a framework which makes it possible to create new content for SMAClite without any special knowledge. We conduct experiments to show that SMAClite is equivalent to SMAC, by training MARL algorithms on SMAClite and reproducing SMAC results. We then show that SMAClite outperforms SMAC in both runtime speed and memory.

This paper has not been read by Pith yet.

discussion (0)

Sign in with ORCID, Apple, or X to comment. Anyone can read and Pith papers without signing in.

Forward citations

Cited by 1 Pith paper

Reviewed papers in the Pith corpus that reference this work. Sorted by Pith novelty score.

  1. Randomness is sometimes necessary for coordination

    cs.AI 2026-05 conditional novelty 7.0

    Structured per-agent randomness via ranked masking in attention allows symmetric agents to break ties and coordinate, achieving perfect success on symmetric tasks where deterministic policies fail and enabling zero-sh...