Towards Operational Validation of LLM-Agent Social Simulations: A Replicated Study of a Reddit-like Technology Forum

Aleksandar Toma\v{s}evi\'c , Darja Cvetkovi\'c , Sara Major , Slobodan Maleti\'c , Miroslav An{\dj}elkovi\'c , Ana Vrani\'c , Boris Stupovski , Du\v{s}an Vudragovi\'c

show 2 more authors

Aleksandar Bogojevi\'c Marija Mitrovi\'c Dankulov

Authors on Pith no claims yet

classification 💻 cs.CY cs.SIphysics.soc-ph

keywords simulatedcommentssimulationssocialtechnologytoxicityvoatwhile

0 comments

read the original abstract

Validation of LLM-agent social simulations remains underdeveloped, with most studies relying on subjective assessments or single runs. We address this gap by running 30 independent 30-day simulations of a technology forum modeled on Voat's v/technology, using stateless Dolphin Mistral 24B agents on the Y Social platform, and evaluating operational validity across five dimensions: activity patterns, network structure, toxicity, topical coverage, and stylistic convergence. Against 30 matched, non-overlapping 30-day Voat comparison windows, results show overlapping 99% confidence intervals for unique users, root posts, and daily active users, while comments, average thread length, and mean toxicity remain higher in simulation. Both simulated and empirical networks exhibit core-periphery structure, though simulated cores are larger and more diffuse and repeated interactions are less frequent. Topic alignment is near-complete, but toxicity is misallocated across content layers: simulated root posts are substantially more toxic than real submissions, while simulated comments are less toxic than Voat comments. These findings demonstrate that LLM agents in platform-faithful environments can reproduce familiar online regularities, while systematic divergences, particularly those linked to stateless agent design and content-layer calibration, point to concrete directions for future improvement.

This paper has not been read by Pith yet.

discussion (0)

Forward citations

Cited by 1 Pith paper

Reviewed papers in the Pith corpus that reference this work. Sorted by Pith novelty score.

Agentic Microphysics: A Manifesto for Generative AI Safety
cs.CY 2026-04 unverdicted novelty 4.0

The authors introduce agentic microphysics and generative safety to link local agent interactions to population-level risks in agentic AI through a causally explicit framework.