Analyzing Micro-Founded General Equilibrium Models with Many Agents using Deep Reinforcement Learning

Alexander Trott; Michael Curry; Soham Phade; Stephan Zheng; Yu Bai

arxiv: 2201.01163 · v2 · pith:V3D7QBP6new · submitted 2022-01-03 · 💻 cs.GT · cs.AI· cs.LG· econ.GN· q-fin.EC

Analyzing Micro-Founded General Equilibrium Models with Many Agents using Deep Reinforcement Learning

Michael Curry , Alexander Trott , Soham Phade , Yu Bai , Stephan Zheng This is my paper

classification 💻 cs.GT cs.AIcs.LGecon.GNq-fin.EC

keywords modelsagentsapproachepsilongenerallearningmarlmeta-equilibria

0 comments

read the original abstract

Real economies can be modeled as a sequential imperfect-information game with many heterogeneous agents, such as consumers, firms, and governments. Dynamic general equilibrium (DGE) models are often used for macroeconomic analysis in this setting. However, finding general equilibria is challenging using existing theoretical or computational methods, especially when using microfoundations to model individual agents. Here, we show how to use deep multi-agent reinforcement learning (MARL) to find $\epsilon$-meta-equilibria over agent types in microfounded DGE models. Whereas standard MARL fails to learn non-trivial solutions, our structured learning curricula enable stable convergence to meaningful solutions. Conceptually, our approach is more flexible and does not need unrealistic assumptions, e.g., continuous market clearing, that are commonly used for analytical tractability. Furthermore, our end-to-end GPU implementation enables fast real-time convergence with a large number of RL economic agents. We showcase our approach in open and closed real-business-cycle (RBC) models with 100 worker-consumers, 10 firms, and a social planner who taxes and redistributes. We validate the learned solutions are $\epsilon$-meta-equilibria through best-response analyses, show that they align with economic intuitions, and show our approach can learn a spectrum of qualitatively distinct $\epsilon$-meta-equilibria in open RBC models. As such, we show that hardware-accelerated MARL is a promising framework for modeling the complexity of economies based on microfoundations.

This paper has not been read by Pith yet.

discussion (0)

Forward citations

Cited by 2 Pith papers

Reviewed papers in the Pith corpus that reference this work. Sorted by Pith novelty score.

Hierarchical Multiagent Reinforcement Learning for Multi-Group Tax Game
cs.MA 2026-05 unverdicted novelty 6.0

A bi-level MARL framework with curriculum learning and closed-loop sequential updates learns stable tax policies in multi-group hierarchical games, extending effective game duration by 60.92% and cutting GDP dispariti...
Hierarchical Multiagent Reinforcement Learning for Multi-Group Tax Game
cs.MA 2026-05 unverdicted novelty 6.0

A bilevel MARL framework with curriculum learning and closed-loop sequential updates learns stable tax policies in multi-group taxation simulations, extending effective game duration by 60.92% and reducing GDP dispari...