Multiagent Cooperation and Competition with Deep Reinforcement Learning

· 2015 · cs.AI · arXiv 1511.08779

1 Pith paper cite this work. Polarity classification is still indexing.

1 Pith paper citing it

open full Pith review browse 1 citing papers arXiv PDF

abstract

Multiagent systems appear in most social, economical, and political situations. In the present work we extend the Deep Q-Learning Network architecture proposed by Google DeepMind to multiagent environments and investigate how two agents controlled by independent Deep Q-Networks interact in the classic videogame Pong. By manipulating the classical rewarding scheme of Pong we demonstrate how competitive and collaborative behaviors emerge. Competitive agents learn to play and score efficiently. Agents trained under collaborative rewarding schemes find an optimal strategy to keep the ball in the game as long as possible. We also describe the progression from competitive to collaborative behavior. The present work demonstrates that Deep Q-Networks can become a practical tool for studying the decentralized learning of multiagent systems living in highly complex environments.

representative citing papers

Distributed Hierarchical Temporal Memory with Shared Associative Memory for Cross-Entity Preemptive Warning

cs.NE · 2026-06-30 · unverdicted · novelty 5.0

D-HTM adds a shared associative memory to hierarchical temporal memory so that precursor signatures learned on one entity can trigger preemptive warnings on related entities, yielding an average 8.1-sample lead time on tested datasets.

citing papers explorer

Showing 1 of 1 citing paper.

Distributed Hierarchical Temporal Memory with Shared Associative Memory for Cross-Entity Preemptive Warning cs.NE · 2026-06-30 · unverdicted · none · ref 20 · internal anchor
D-HTM adds a shared associative memory to hierarchical temporal memory so that precursor signatures learned on one entity can trigger preemptive warnings on related entities, yielding an average 8.1-sample lead time on tested datasets.

Multiagent Cooperation and Competition with Deep Reinforcement Learning

fields

years

verdicts

representative citing papers

citing papers explorer