pith. machine review for the scientific record. sign in

arxiv: 2603.00113 · v2 · submitted 2026-02-19 · 💻 cs.MA · cs.AI· cs.CE· cs.CY· cs.SI

Recognition: unknown

AI Agents Alone Are Not (Yet) Sufficient for Social Simulation

Authors on Pith no claims yet
classification 💻 cs.MA cs.AIcs.CEcs.CYcs.SI
keywords agentssimulationsocialaloneexplicitmechanismsschedulingsufficient
0
0 comments X
read the original abstract

Recent advances in large language models (LLMs) have spurred growing interest in using LLM-integrated agents for social simulation, often under the implicit assumption that realistic population dynamics will emerge once role-specified agents are placed in a networked multi-agent setting. This position paper argues that LLM-based agents alone are not (yet) sufficient for social simulation. We attribute this over-optimism to a systematic mismatch between what current agent pipelines are typically optimized and validated to produce and what simulation-as-science requires. Concretely, role-playing plausibility does not imply faithful human behavioral validity; collective outcomes are frequently mediated by agent-environment co-dynamics rather than agent-agent messaging alone; and results can be dominated by interaction protocols, scheduling, and initial information priors. To make these underlying mechanisms explicit and auditable, we propose a unified formulation of AI agent-based social simulation as an environment-involved Markov game with explicit exposure and scheduling mechanisms, from which we derive concrete actions for design, evaluation, and interpretation.

This paper has not been read by Pith yet.

discussion (0)

Sign in with ORCID, Apple, or X to comment. Anyone can read and Pith papers without signing in.

Forward citations

Cited by 1 Pith paper

Reviewed papers in the Pith corpus that reference this work. Sorted by Pith novelty score.

  1. The $\textit{Silicon Society}$ Cookbook: Design Space of LLM-based Social Simulations

    cs.MA 2026-04 unverdicted novelty 5.0

    The base LLM choice dominates simulation outcomes in LLM-based social networks, while other design parameters show either additive or complex interactive effects.