AI Persuasive Framing in Collective Dilemmas

Alessia Galdeman; Anders Giovanni M{\o}ller; Arianna Pera; Luca Maria Aiello

arxiv: 2606.27951 · v1 · pith:WCPCHPKVnew · submitted 2026-06-26 · 💻 cs.CY · cs.CL· cs.HC· physics.soc-ph

AI Persuasive Framing in Collective Dilemmas

Anders Giovanni M{\o}ller , Alessia Galdeman , Arianna Pera , Luca Maria Aiello This is my paper

Pith reviewed 2026-06-29 02:14 UTC · model grok-4.3

classification 💻 cs.CY cs.CLcs.HCphysics.soc-ph

keywords AI persuasioncollective dilemmassocial value orientationcooperationnudgesdual-use risksgroup behaviorcollective risk games

0 comments

The pith

Personalized AI framing raises short-term contributions in collective risk games but selfish versions reduce them more and longer.

A machine-rendered reading of the paper's core claim, the machinery that carries it, and where it could break.

The paper tests whether AI assistants can nudge players toward cooperation in iterated collective risk games by delivering persuasive messages tailored to each person's Social Value Orientation profile. A sympathetic reader would care because the results show AI can shift group outcomes in either direction, with the negative shifts proving harder to reverse. The work establishes an asymmetry: prosocial framing lifts contributions and success rates at first, yet these gains fade after a few rounds, while reconfigured selfish framing produces larger and more persistent drops, especially when personalized.

Core claim

In small groups playing iterated Collective Risk Games, AI assistants that used persuasive framing matched to each player's Social Value Orientation profile significantly raised individual contributions and group success rates. These cooperative gains lasted only through the first few rounds before fading. When the same AI system was instead set to promote selfish behavior with exculpatory framing, the reductions in contributions and success rates were larger in magnitude and substantially more persistent over time, with personalization amplifying the negative impact.

What carries the argument

AI persuasive framing personalized to each player's Social Value Orientation profile, which tailors messages to encourage or discourage contributions within the iterated Collective Risk Game.

If this is right

AI assistants can temporarily increase cooperation and collective success in repeated group risk settings when messages are personalized.
The same AI capability produces larger and longer-lasting reductions in cooperation when reconfigured to promote selfish behavior.
Personalization strengthens both the short-term cooperative boost and the more enduring antisocial effect.
Prosocial AI effects are limited to initial rounds while antisocial effects endure across more rounds.
AI systems carry dual-use potential for influencing collective action outcomes in either direction.

Where Pith is reading between the lines

These are editorial extensions of the paper, not claims the author makes directly.

Safeguards against repurposing cooperative AI nudges for defection may be needed in deployed systems.
The observed asymmetry could be tested by varying message framing intensity or combining it with repeated exposure over longer game horizons.
Real-world collective action problems such as public goods contributions might show similar patterns if AI framing is introduced.
Designs that aim to sustain prosocial effects could draw on the mechanisms that make selfish framing more durable.

Load-bearing premise

Observed differences in contributions can be attributed to the AI framing manipulation rather than unmeasured group dynamics, order effects, or selection into the participant pool, and the Social Value Orientation instrument validly captures the individual differences that matter for personalization.

What would settle it

A follow-up experiment in which contributions and success rates show no reliable difference between the personalized prosocial AI condition and a neutral control, or in which the negative effects of selfish framing fade at the same rate as the positive effects.

Figures

Figures reproduced from arXiv: 2606.27951 by Alessia Galdeman, Anders Giovanni M{\o}ller, Arianna Pera, Luca Maria Aiello.

**Figure 1.** Figure 1: Experimental Overview. (A) The participant journey included onboarding, five game rounds, a post-study survey, and a reward page. (B) During each round, participants either submitted their contribution directly (control), reconsidered it after reading a static message, or interacted with an LLM encouraging either cooperative or selfish behavior. Messages and LLM interactions could be personalized based in … view at source ↗

**Figure 2.** Figure 2: Effects of interventions across rounds. Top: Average contributions. Middle: Average group success rates. Bottom: Withinround changes from participants’ initial pledge to their final contribution. Points show condition means and error bars indicate 95% confidence intervals. Positive within-round changes indicate increased contribution, while negative changes indicate reductions. Asterisks reflect significa… view at source ↗

**Figure 3.** Figure 3: Distributions of mean player contribution across all games, disaggregated by treatment. [PITH_FULL_IMAGE:figures/full_fig_p010_3.png] view at source ↗

**Figure 4.** Figure 4: Cooperative AI. Regression coefficients of OLS models predicting outcomes at Round [PITH_FULL_IMAGE:figures/full_fig_p010_4.png] view at source ↗

**Figure 5.** Figure 5: Selfish AI. Regression coefficients of OLS models predicting outcomes ar Round [PITH_FULL_IMAGE:figures/full_fig_p011_5.png] view at source ↗

**Figure 6.** Figure 6: Cooperative AI. Regression coefficients of OLS models predicting outcomes at Round 1. [PITH_FULL_IMAGE:figures/full_fig_p011_6.png] view at source ↗

**Figure 7.** Figure 7: Selfish AI. Regression coefficients of OLS models predicting outcomes at Round 1. [PITH_FULL_IMAGE:figures/full_fig_p012_7.png] view at source ↗

**Figure 8.** Figure 8: Statistics about the interaction between participants and the AI assistants. [PITH_FULL_IMAGE:figures/full_fig_p012_8.png] view at source ↗

**Figure 9.** Figure 9: Age distribution for all participants. Much less Somewhat less No influence Somewhat more Much more Perceived effect of AI on contribution 0% 10% 20% 30% 40% 50% Proportion AI Persuasion Perceptions by Treatment Cooperative (Non-pers.) Cooperative Selfish (Non-pers.) Selfish [PITH_FULL_IMAGE:figures/full_fig_p020_9.png] view at source ↗

**Figure 10.** Figure 10: Answer proportions to the question: How did the AI influence your contribution decisions? [PITH_FULL_IMAGE:figures/full_fig_p020_10.png] view at source ↗

read the original abstract

AI agents are promising tools that can act as flexible behavioral nudges to enhance human cooperation in addressing large-scale societal problems. However, evidence on whether AI agents can effectively boost cooperation remains mixed. We recruited 1,283 participants to play iterated Collective Risk Games in small groups, testing whether AI assistants could nudge participants toward cooperation. By using persuasive framing personalized to each player's Social Value Orientation profile, the AI interventions significantly increased contributions and group success rates. These cooperative effects were short-lived, however, fading after the first few rounds. Strikingly, when the AI treatments were reconfigured to promote selfish behavior through exculpatory framing, the negative effects on contributions and group success were larger and substantially more persistent, particularly for personalized interventions. This asymmetry between prosocial and antisocial persuasion highlights the dual-use risks of AI systems designed to influence group behavior in collective action settings.

Editorial analysis

A structured set of objections, weighed in public.

Desk editor's note, referee report, simulated authors' rebuttal, and a circularity audit. Tearing a paper down is the easy half of reading it; the pith above is the substance, this is the friction.

Desk Editor's Note private letter to a colleague

The paper reports a short-lived cooperation boost from prosocial AI framing but more persistent drops from selfish framing in collective risk games, based on a large experiment, though methods details are thin.

read the letter

The one or two things to know are that the paper finds short-lived increases in cooperation from prosocial AI framing but larger and more persistent decreases from antisocial framing in these games, and that this comes from a sizable experiment with over a thousand participants.

The work does well by actually running the iterated collective risk game with real people and testing personalization, which extends prior nudge research into AI agents. The asymmetry result is a legitimate new observation worth noting for anyone thinking about AI in group settings.

The soft spots are the missing details. No information on the statistical tests, effect sizes, or how they dealt with the repeated measures and group-level dependencies in the design. The concern about confounds from interdependence and order effects in the iterated setup looks real without seeing the regressions or randomization protocol. If the full paper doesn't have those controls, the durability difference might not hold up as a framing effect.

This paper is for behavioral scientists and AI ethics folks working on persuasion and collective action. A reader focused on those areas could find the empirical pattern useful if the methods are sound.

I would recommend sending it for peer review to get the full methods and analysis checked, since the topic matters and the experiment is large enough to be worth the time.

Referee Report

3 major / 2 minor

Summary. The paper reports results from an experiment with 1,283 participants playing iterated Collective Risk Games in small groups. AI assistants deliver persuasive framing (prosocial or exculpatory) that is either personalized to each player's Social Value Orientation (SVO) profile or generic. The central claims are that personalized prosocial framing significantly raises contributions and group success rates (but effects fade after the first few rounds), while exculpatory framing produces larger and more persistent negative effects on the same outcomes, with the asymmetry especially pronounced under personalization. The work concludes by highlighting dual-use risks of AI in collective-action settings.

Significance. If the reported asymmetry survives controls for group interdependence and order effects, the result would document a practically relevant difference in the durability of prosocial versus antisocial AI persuasion and would supply concrete evidence on dual-use concerns for AI systems deployed in collective dilemmas.

major comments (3)

[Results / Statistical Analysis] The manuscript supplies no information on the regression specification used to test treatment effects (e.g., inclusion of lagged group success, round fixed effects, or group-level random effects). In an iterated game where each round's outcome directly shapes subsequent incentives, the absence of these controls leaves open the possibility that the short-lived prosocial effect versus persistent antisocial effect reflects differential carry-over rather than framing asymmetry.
[Methods] No details are provided on randomization procedure, attrition, pre-registration, exact statistical tests, effect sizes, or multiple-comparison corrections. These omissions make it impossible to assess whether the claimed statistical significance and the prosocial–antisocial asymmetry are robust.
[Results] The personalization claim rests on SVO moderating responsiveness to framing, yet the text does not report whether baseline SVO predicts pre-treatment contributions or whether the treatment-by-SVO interaction remains significant under alternative specifications (e.g., continuous vs. categorical SVO, different clustering).

minor comments (2)

[Abstract] The abstract states 'statistically significant effects' without naming the tests or reporting effect sizes; this should be expanded in the main text for transparency.
[Methods] Notation for the Collective Risk Game payoffs and success threshold should be defined explicitly in the first methods subsection rather than assumed from prior literature.

Simulated Author's Rebuttal

3 responses · 0 unresolved

We thank the referee for their constructive comments, which have helped us strengthen the manuscript. We address each major point below and have revised the paper to incorporate additional methodological and statistical details where feasible.

read point-by-point responses

Referee: [Results / Statistical Analysis] The manuscript supplies no information on the regression specification used to test treatment effects (e.g., inclusion of lagged group success, round fixed effects, or group-level random effects). In an iterated game where each round's outcome directly shapes subsequent incentives, the absence of these controls leaves open the possibility that the short-lived prosocial effect versus persistent antisocial effect reflects differential carry-over rather than framing asymmetry.

Authors: We agree that the original manuscript lacked sufficient detail on the regression models. In the revised version, we now explicitly describe the primary specification as a linear mixed-effects model with round fixed effects, a lagged indicator for prior group success, and group-level random intercepts to account for within-group interdependence. We also include robustness checks using alternative lag structures and player-level clustering. These analyses confirm that the asymmetry in effect persistence between prosocial and exculpatory framing remains statistically significant after controlling for carry-over effects. revision: yes
Referee: [Methods] No details are provided on randomization procedure, attrition, pre-registration, exact statistical tests, effect sizes, or multiple-comparison corrections. These omissions make it impossible to assess whether the claimed statistical significance and the prosocial–antisocial asymmetry are robust.

Authors: We have expanded the Methods section to include: block randomization at the group level via the experimental platform; attrition rates (low and balanced across conditions, with analysis of completers vs. dropouts); exact tests (mixed-effects regressions and t-tests with reported p-values); effect sizes (standardized coefficients and Cohen's d); and Bonferroni corrections for multiple comparisons. The study was not pre-registered; we now explicitly note this limitation and its implications for interpretation while reporting all analyses as pre-specified in our internal protocol. revision: partial
Referee: [Results] The personalization claim rests on SVO moderating responsiveness to framing, yet the text does not report whether baseline SVO predicts pre-treatment contributions or whether the treatment-by-SVO interaction remains significant under alternative specifications (e.g., continuous vs. categorical SVO, different clustering).

Authors: We have added new analyses in the Results section demonstrating that baseline SVO does not predict pre-treatment contributions (p > .10 across specifications). The treatment-by-SVO interaction remains significant when SVO is modeled continuously or categorically and under both individual- and group-level clustering. These supplementary results are now reported with full model tables. revision: yes

Circularity Check

0 steps flagged

Empirical experiment reports measured outcomes with no derivation chain

full rationale

The paper describes a participant study in iterated Collective Risk Games, measuring contribution levels and group success under different AI framing conditions. No equations, fitted parameters, or predictions are defined such that any reported effect reduces to an input by construction. Claims rest on observed data differences rather than self-referential definitions or self-citation chains that substitute for independent evidence. Self-citations to prior SVO or game theory work are standard and do not carry the central empirical result.

Axiom & Free-Parameter Ledger

0 free parameters · 2 axioms · 0 invented entities

The central claim rests on the validity of the Social Value Orientation instrument for personalization and on standard assumptions that the experimental contrast isolates the effect of framing; these are domain-standard but not independently demonstrated within the abstract.

axioms (2)

domain assumption Social Value Orientation profiles validly predict differential responsiveness to prosocial versus exculpatory framing
Invoked to justify personalization of AI messages and to interpret the asymmetry result.
standard math Standard assumptions of statistical hypothesis testing apply to the reported significance and persistence differences
Required to interpret the statements that effects were 'significantly increased' and 'substantially more persistent'.

pith-pipeline@v0.9.1-grok · 5693 in / 1464 out tokens · 59919 ms · 2026-06-29T02:14:26.942252+00:00 · methodology

discussion (0)

Reference graph

Works this paper leans on

62 extracted references

[1]

Maria Abou Chakra and Arne Traulsen. 2012. Evolutionary dynamics of strategic behavior in a collective-risk dilemma. (2012)

2012
[2]

Marina Agranov. 2024. Communication in stag hunt games: When does it really help?Economics Letters244 (2024), 111991

2024
[3]

Chowdhury Mohammad Sakib Anwar and Konstantinos Georgalos. 2026. Playing Against the Machine: Cooperation, Communication, and Strategy Heterogeneity in Repeated Prisoner’s Dilemma.arXiv preprint arXiv:2603.15852(2026)

arXiv 2026
[4]

2018.Narratives, imperatives, and moral reasoning

Roland Bénabou, Armin Falk, and Jean Tirole. 2018.Narratives, imperatives, and moral reasoning. Technical Report. National Bureau of Economic Research

2018
[5]

Robert D Benford and David A Snow. 2000. Framing processes and social movements: An overview and assessment.Annual review of sociology26, 2000 (2000), 611–639

2000
[6]

Cristina Bicchieri and Azi Lev-On. 2007. Computer-mediated communication and cooperation in social dilemmas: an experimental analysis.politics, philosophy & economics6, 2 (2007), 139–168

2007
[7]

Simon Martin Breum, Daniel Vædele Egdal, Victor Gram Mortensen, Anders Giovanni Møller, and Luca Maria Aiello. 2024. The persuasive power of large language models. InProceedings of the International AAAI Conference on Web and Social Media, Vol. 18. 152–163

2024
[8]

Joana Brito, Regina de Brito Duarte, Henrique C Fonseca, Joana Campos, Filipa Correia, and Ana Paiva. 2025. Assistant Robots with an Agenda foster Uncooperative Behaviors. In2025 34th IEEE International Conference on Robot and Human Interactive Communication (RO-MAN). IEEE, 1952–1959

2025
[9]

Lin Chen, Yunke Zhang, Jie Feng, Haoye Chai, Honglin Zhang, Bingbing Fan, Yibo Ma, Shiyuan Zhang, Nian Li, Tianhui Liu, et al. 2026. AI agent behavioral science.Humanities and Social Sciences Communications(2026)

2026
[10]

Xiaojie Chen, Attila Szolnoki, and Matjaž Perc. 2012. Averting group failures in collective-risk social dilemmas.Europhysics Letters99, 6 (2012), 68003

2012
[11]

Thomas H Costello, Gordon Pennycook, and David G Rand. 2024. Durably reducing conspiracy beliefs through dialogues with AI.Science385, 6714 (2024), eadq1814

2024
[12]

Esin Durmus, Liane Lovitt, Alex Tamkin, Stuart Ritchie, Jack Clark, and Deep Ganguli. 2024. Measuring the persuasiveness of language models. Anthropic Blog(2024)

2024
[13]

Mike Farjam, Olexandr Nikolaychuk, and Giangiacomo Bravo. 2018. Does risk communication really decrease cooperation in climate change mitigation?Climatic change149, 2 (2018), 147–158

2018
[14]

Ernst Fehr, Urs Fischbacher, and Simon Gächter. 2002. Strong reciprocity, human cooperation, and the enforcement of social norms.Human nature 13, 1 (2002), 1–25

2002
[15]

Ernst Fehr and Simon Gächter. 2000. Cooperation and punishment in public goods experiments.American Economic Review90, 4 (2000), 980–994

2000
[16]

Chen Gao, Xiaochong Lan, Nian Li, Yuan Yuan, Jingtao Ding, Zhilun Zhou, Fengli Xu, and Yong Li. 2024. Large language models empowered agent-based modeling and simulation: A survey and perspectives.Humanities and Social Sciences Communications11, 1 (2024), 1–24

2024
[17]

Sergey Gavrilets, Denis Tverskoi, Nianyi Wang, Xiaomin Wang, Juan Ozaita, Boyu Zhang, Angel Sánchez, and Giulia Andrighetto. 2024. Co-evolution of behaviour and beliefs in social dilemmas: estimating material, social, cognitive and cultural determinants.Evolutionary Human Sciences6 (2024), e50

2024
[18]

Arend Hintze and Christoph Adami. 2026. Promoting cooperation in the public goods game using artificial intelligent agents.npj Complexity3, 1 (2026), 3

2026
[19]

Guanxiong Huang and Sai Wang. 2023. Is artificial intelligence more persuasive than humans? A meta-analysis.Journal of Communication73, 6 (2023), 552–562

2023
[20]

Elise Karinshak, Sunny Xun Liu, Joon Sung Park, and Jeffrey T Hancock. 2023. Working with AI to persuade: Examining a large language model’s ability to generate pro-vaccination messages.Proceedings of the ACM on Human-Computer Interaction7, CSCW1 (2023), 1–29

2023
[21]

Maria Kleshnina, Christian Hilbe, Štěpán Šimsa, Krishnendu Chatterjee, and Martin A Nowak. 2023. The effect of environmental information on evolution of cooperation in stochastic games.Nature Communications14, 1 (2023), 4153

2023
[22]

Raphael Koster, Jan Balaguer, Andrea Tacchetti, Ari Weinstein, Tina Zhu, Oliver Hauser, Duncan Williams, Lucy Campbell-Gillingham, Phoebe Thacker, Matthew Botvinick, et al. 2022. Human-centred mechanism design with Democratic AI.Nature Human Behaviour6, 10 (2022), 1398–1407

2022
[23]

1994.Public goods: A survey of experimental research

John O Ledyard et al. 1994.Public goods: A survey of experimental research. Division of the Humanities and Social Sciences, California Inst. of Technology

1994
[24]

2023.IPCC, 2023: Climate change 2023: Synthesis report, summary for policymakers

Hoesung Lee, Katherine Calvin, Dipak Dasgupta, Gerhard Krinner, Aditi Mukherji, Peter Thorne, Christopher Trisos, José Romero, Paulina Aldunce, Ko Barret, et al. 2023.IPCC, 2023: Climate change 2023: Synthesis report, summary for policymakers. Contribution of working groups i, II and III to the sixth assessment report of the intergovernmental panel on cli...

2023
[25]

Sandra C Matz, Jacob D Teeny, Sumer S Vaid, Heinrich Peters, Gabriella M Harari, and Moran Cerf. 2024. The potential of generative AI for personalized persuasion at scale.Scientific Reports14, 1 (2024), 4692

2024
[26]

Theresa Matzinger, Marek Placiński, Adam Gutowski, Mariusz Lewandowski, Przemysław Żywiczyński, and Sławomir Wacewicz. 2024. Inherent linguistic preference outcompetes incidental alignment in cooperative partner choice.Language and Cognition16, 4 (2024), 1834–1851

2024
[27]

Anna Mikhaylovskaya. 2024. Enhancing deliberation with digital democratic innovations.Philosophy & Technology37, 3 (2024), 3. 14 Møller et al

2024
[28]

Manfred Milinski, Ralf D Sommerfeld, Hans-Jürgen Krambeck, Floyd A Reed, and Jochem Marotzke. 2008. The collective-risk social dilemma and the prevention of simulated dangerous climate change.Proceedings of the National Academy of Sciences105, 7 (2008), 2291–2294

2008
[29]

Ryan O Murphy, Kurt A Ackermann, and Michel JJ Handgraaf. 2011. Measuring social value orientation.Judgment and Decision making6, 8 (2011), 771–781

2011
[30]

1994.A course in game theory

Martin J Osborne and Ariel Rubinstein. 1994.A course in game theory. MIT press

1994
[31]

Stefan Palan and Christian Schitter. 2018. Prolific. ac—A subject pool for online experiments.Journal of behavioral and experimental finance17 (2018), 22–27

2018
[32]

Thomas R Palfrey and Howard Rosenthal. 1984. Participation and the provision of discrete public goods: a strategic analysis.Journal of public Economics24, 2 (1984), 171–193

1984
[33]

María Pereda, Valerio Capraro, and Angel Sánchez. 2019. Group size effects and critical mass in public goods games.Scientific reports9, 1 (2019), 5503

2019
[34]

Zhongheng Qiao, Alex Tabarrok, Robertas Zubrickas, and Timothy N Cason. 2026. Norms in Conflict: Why AI Advisors Fail to Improve Human Coordination.A vailable at SSRN 6350778(2026)

2026
[35]

Jennifer Renoux, Filipa Correia, Joana Campos, Lucas Morillo-Mendez, Neziha Akalin, Fernando P Santos, and Ana Paiva. 2025. The Effect of Agent-Based Feedback on Prosociality in Social Dilemmas. InProceedings of the 24th International Conference on Autonomous Agents and Multiagent Systems. 1755–1763

2025
[36]

Francesco Salvi, Manoel Horta Ribeiro, Riccardo Gallotti, and Robert West. 2024. On the conversational persuasiveness of large language models: A randomized controlled trial. (2024)

2024
[37]

Chenxinran Shen, Jurgis Karpus, Thomas Kosch, Daniela Fernandes, Beatriz Mello, Robin Welsch, and Steeven Villa. 2025. The Impact of Asymmetric AI Assistance on Decision-Making in Social Dilemmas: A Study on Human Augmentation in Economic Games. InProceedings of the Augmented Humans International Conference 2025. 187–198

2025
[38]

Nicholas Stern, Mattia Romani, Roberta Pierfederici, Manuel Braun, Daniel Barraclough, Shajeeshan Lingeswaran, Elizabeth Weirich-Benet, and Niklas Niemann. 2025. Green and intelligent: the role of AI in the climate transition.npj Climate Action4, 1 (2025), 56

2025
[39]

Aron Szekely, Francesca Lipari, Alberto Antonioni, Mario Paolucci, Angel Sánchez, Luca Tummolini, and Giulia Andrighetto. 2021. Evidence from a long-term experiment that collective risks change social norms and promote cooperation.Nature communications12, 1 (2021), 5452

2021
[40]

Alessandro Tavoni, Astrid Dannenberg, Giorgos Kallis, and Andreas Löschel. 2011. Inequality, communication, and the avoidance of disastrous climate change in a public goods game.Proceedings of the National Academy of Sciences108, 29 (2011), 11825–11829

2011
[41]

Michael Henry Tessler, Michiel A Bakker, Daniel Jarrett, Hannah Sheahan, Martin J Chadwick, Raphael Koster, Georgina Evans, Lucy Campbell- Gillingham, Tantum Collins, David C Parkes, et al. 2024. AI can help humans find common ground in democratic deliberation.Science386, 6719 (2024), eadq2852

2024
[42]

Milena Tsvetkova, Taha Yasseri, Niccolo Pescetelli, and Tobias Werner. 2024. A new sociology of humans and machines.Nature Human Behaviour8, 10 (2024), 1864–1876

2024
[43]

Denis Tverskoi, Andrea Guido, Giulia Andrighetto, Angel Sánchez, and Sergey Gavrilets. 2023. Disentangling material, social, and cognitive determinants of human behavior and beliefs.Humanities and Social Sciences Communications10, 1 (2023)

2023
[44]

Jinhua Zhao, Xianjia Wang, Lei Niu, Rui Ding, and Cuiling Gu. 2023. Cooperation in collective risk dilemmas with endogenous heterogeneity: Nash equilibrium and evolutionary dynamics.Chaos, Solitons & Fractals175 (2023), 114009. AI Persuasive Framing in Collective Dilemmas 15 APPENDIX A Experimental Messages A.1 Prognostic Messages (1) The way for the grou...

2023
[45]

I will contribute [initial choice] tokens to the pool

Player made INITIAL PLEDGE: "I will contribute [initial choice] tokens to the pool"
[46]

NOW: This conversation (30 second minimum) - RECONSIDERATION WINDOW
[47]

NEXT: Player makes FINAL CONTRIBUTION DECISION (can change or keep initial pledge)
[48]

changing

Your goal: Influence how much they CONTRIBUTE in their final decision PLAYER PROFILE: 16 Møller et al. - SVO Classification: [SVO classification] - Current Round Initial Pledge: [initial choice] tokens (NOT yet locked in - can still change) - Prior Rounds Final Decisions: [prior decisions] tokens ([number] completed rounds) - Prior Rounds Average: [averag...
[50]

Apply SVO-specific persuasion to guide them toward CONTRIBUTING MORE (giving more to pool)
[51]

Create psychological momentum for CONTRIBUTING MORE in current AND future rounds
[52]

you can still change

Be conversational - don't explicitly say "you can still change" but subtly open the door for reconsideration B.3 SVO-specific Persuasion Strategies for the Cooperative Agent PERSUASION STRATEGY - COOPERATIVE PLAYER: - Reinforce group responsibility and moral duty to team - Appeal to fairness principles and collective success - Frame higher contributions a...
[53]

Engage naturally about how much they initially pledged to contribute and their reasoning
[54]

Apply SVO-specific persuasion to guide them toward CONTRIBUTING LESS (giving less to pool = keeping more money)
[55]

Create psychological momentum for CONTRIBUTING LESS in current AND future rounds
[56]

you can still change

Be conversational - don't explicitly say "you can still change" but subtly open the door for reconsideration
[57]

Frame contributing less as smart, rational, and financially optimal
[58]

don't let others profit from your sacrifice

Downplay disaster risk and emphasize that contributing less = more personal earnings B.5 SVO-specific Persuasion Strategies for the Cooperative Agent PERSUASION STRATEGY - COOPERATIVE PLAYER (SELFISH MODE): - Challenge their cooperative instincts as naive and exploitable - Frame others as free-riders who will take advantage of their generosity - Appeal to...
[62]

The four statements used a five-point Likert scale:Strongly Disagree,Disagree,Neutral,Agree, andStrongly Agree

Most people will act fairly, even when no one is watching. The four statements used a five-point Likert scale:Strongly Disagree,Disagree,Neutral,Agree, andStrongly Agree. C.2 Social Value Orientation test Participants received the following instructions: Imagine the following scenario: you are paired with another anonymous person, and both of you will cho...
[63]

I believe that other people tend to be more cooperative than I am
[64]

I usually trust others when making decisions in group settings
[65]

I am willing to make personal sacrifices to help others
[66]

These items used the same five-point scale:Strongly Disagree,Disagree,Neutral,Agree, andStrongly Agree

Most people will act fairly, even when no one is watching. These items used the same five-point scale:Strongly Disagree,Disagree,Neutral,Agree, andStrongly Agree. D Demographics and Survey Answer Visualizations 20 40 60 80 100 Age (years) 0 50 100 150 200 Count Age Distribution Fig. 9. Age distribution for all participants. Much less Somewhat less No infl...

[1] [1]

Maria Abou Chakra and Arne Traulsen. 2012. Evolutionary dynamics of strategic behavior in a collective-risk dilemma. (2012)

2012

[2] [2]

Marina Agranov. 2024. Communication in stag hunt games: When does it really help?Economics Letters244 (2024), 111991

2024

[3] [3]

Chowdhury Mohammad Sakib Anwar and Konstantinos Georgalos. 2026. Playing Against the Machine: Cooperation, Communication, and Strategy Heterogeneity in Repeated Prisoner’s Dilemma.arXiv preprint arXiv:2603.15852(2026)

arXiv 2026

[4] [4]

2018.Narratives, imperatives, and moral reasoning

Roland Bénabou, Armin Falk, and Jean Tirole. 2018.Narratives, imperatives, and moral reasoning. Technical Report. National Bureau of Economic Research

2018

[5] [5]

Robert D Benford and David A Snow. 2000. Framing processes and social movements: An overview and assessment.Annual review of sociology26, 2000 (2000), 611–639

2000

[6] [6]

Cristina Bicchieri and Azi Lev-On. 2007. Computer-mediated communication and cooperation in social dilemmas: an experimental analysis.politics, philosophy & economics6, 2 (2007), 139–168

2007

[7] [7]

Simon Martin Breum, Daniel Vædele Egdal, Victor Gram Mortensen, Anders Giovanni Møller, and Luca Maria Aiello. 2024. The persuasive power of large language models. InProceedings of the International AAAI Conference on Web and Social Media, Vol. 18. 152–163

2024

[8] [8]

Joana Brito, Regina de Brito Duarte, Henrique C Fonseca, Joana Campos, Filipa Correia, and Ana Paiva. 2025. Assistant Robots with an Agenda foster Uncooperative Behaviors. In2025 34th IEEE International Conference on Robot and Human Interactive Communication (RO-MAN). IEEE, 1952–1959

2025

[9] [9]

Lin Chen, Yunke Zhang, Jie Feng, Haoye Chai, Honglin Zhang, Bingbing Fan, Yibo Ma, Shiyuan Zhang, Nian Li, Tianhui Liu, et al. 2026. AI agent behavioral science.Humanities and Social Sciences Communications(2026)

2026

[10] [10]

Xiaojie Chen, Attila Szolnoki, and Matjaž Perc. 2012. Averting group failures in collective-risk social dilemmas.Europhysics Letters99, 6 (2012), 68003

2012

[11] [11]

Thomas H Costello, Gordon Pennycook, and David G Rand. 2024. Durably reducing conspiracy beliefs through dialogues with AI.Science385, 6714 (2024), eadq1814

2024

[12] [12]

Esin Durmus, Liane Lovitt, Alex Tamkin, Stuart Ritchie, Jack Clark, and Deep Ganguli. 2024. Measuring the persuasiveness of language models. Anthropic Blog(2024)

2024

[13] [13]

Mike Farjam, Olexandr Nikolaychuk, and Giangiacomo Bravo. 2018. Does risk communication really decrease cooperation in climate change mitigation?Climatic change149, 2 (2018), 147–158

2018

[14] [14]

Ernst Fehr, Urs Fischbacher, and Simon Gächter. 2002. Strong reciprocity, human cooperation, and the enforcement of social norms.Human nature 13, 1 (2002), 1–25

2002

[15] [15]

Ernst Fehr and Simon Gächter. 2000. Cooperation and punishment in public goods experiments.American Economic Review90, 4 (2000), 980–994

2000

[16] [16]

Chen Gao, Xiaochong Lan, Nian Li, Yuan Yuan, Jingtao Ding, Zhilun Zhou, Fengli Xu, and Yong Li. 2024. Large language models empowered agent-based modeling and simulation: A survey and perspectives.Humanities and Social Sciences Communications11, 1 (2024), 1–24

2024

[17] [17]

Sergey Gavrilets, Denis Tverskoi, Nianyi Wang, Xiaomin Wang, Juan Ozaita, Boyu Zhang, Angel Sánchez, and Giulia Andrighetto. 2024. Co-evolution of behaviour and beliefs in social dilemmas: estimating material, social, cognitive and cultural determinants.Evolutionary Human Sciences6 (2024), e50

2024

[18] [18]

Arend Hintze and Christoph Adami. 2026. Promoting cooperation in the public goods game using artificial intelligent agents.npj Complexity3, 1 (2026), 3

2026

[19] [19]

Guanxiong Huang and Sai Wang. 2023. Is artificial intelligence more persuasive than humans? A meta-analysis.Journal of Communication73, 6 (2023), 552–562

2023

[20] [20]

Elise Karinshak, Sunny Xun Liu, Joon Sung Park, and Jeffrey T Hancock. 2023. Working with AI to persuade: Examining a large language model’s ability to generate pro-vaccination messages.Proceedings of the ACM on Human-Computer Interaction7, CSCW1 (2023), 1–29

2023

[21] [21]

Maria Kleshnina, Christian Hilbe, Štěpán Šimsa, Krishnendu Chatterjee, and Martin A Nowak. 2023. The effect of environmental information on evolution of cooperation in stochastic games.Nature Communications14, 1 (2023), 4153

2023

[22] [22]

Raphael Koster, Jan Balaguer, Andrea Tacchetti, Ari Weinstein, Tina Zhu, Oliver Hauser, Duncan Williams, Lucy Campbell-Gillingham, Phoebe Thacker, Matthew Botvinick, et al. 2022. Human-centred mechanism design with Democratic AI.Nature Human Behaviour6, 10 (2022), 1398–1407

2022

[23] [23]

1994.Public goods: A survey of experimental research

John O Ledyard et al. 1994.Public goods: A survey of experimental research. Division of the Humanities and Social Sciences, California Inst. of Technology

1994

[24] [24]

2023.IPCC, 2023: Climate change 2023: Synthesis report, summary for policymakers

Hoesung Lee, Katherine Calvin, Dipak Dasgupta, Gerhard Krinner, Aditi Mukherji, Peter Thorne, Christopher Trisos, José Romero, Paulina Aldunce, Ko Barret, et al. 2023.IPCC, 2023: Climate change 2023: Synthesis report, summary for policymakers. Contribution of working groups i, II and III to the sixth assessment report of the intergovernmental panel on cli...

2023

[25] [25]

Sandra C Matz, Jacob D Teeny, Sumer S Vaid, Heinrich Peters, Gabriella M Harari, and Moran Cerf. 2024. The potential of generative AI for personalized persuasion at scale.Scientific Reports14, 1 (2024), 4692

2024

[26] [26]

Theresa Matzinger, Marek Placiński, Adam Gutowski, Mariusz Lewandowski, Przemysław Żywiczyński, and Sławomir Wacewicz. 2024. Inherent linguistic preference outcompetes incidental alignment in cooperative partner choice.Language and Cognition16, 4 (2024), 1834–1851

2024

[27] [27]

Anna Mikhaylovskaya. 2024. Enhancing deliberation with digital democratic innovations.Philosophy & Technology37, 3 (2024), 3. 14 Møller et al

2024

[28] [28]

Manfred Milinski, Ralf D Sommerfeld, Hans-Jürgen Krambeck, Floyd A Reed, and Jochem Marotzke. 2008. The collective-risk social dilemma and the prevention of simulated dangerous climate change.Proceedings of the National Academy of Sciences105, 7 (2008), 2291–2294

2008

[29] [29]

Ryan O Murphy, Kurt A Ackermann, and Michel JJ Handgraaf. 2011. Measuring social value orientation.Judgment and Decision making6, 8 (2011), 771–781

2011

[30] [30]

1994.A course in game theory

Martin J Osborne and Ariel Rubinstein. 1994.A course in game theory. MIT press

1994

[31] [31]

Stefan Palan and Christian Schitter. 2018. Prolific. ac—A subject pool for online experiments.Journal of behavioral and experimental finance17 (2018), 22–27

2018

[32] [32]

Thomas R Palfrey and Howard Rosenthal. 1984. Participation and the provision of discrete public goods: a strategic analysis.Journal of public Economics24, 2 (1984), 171–193

1984

[33] [33]

María Pereda, Valerio Capraro, and Angel Sánchez. 2019. Group size effects and critical mass in public goods games.Scientific reports9, 1 (2019), 5503

2019

[34] [34]

Zhongheng Qiao, Alex Tabarrok, Robertas Zubrickas, and Timothy N Cason. 2026. Norms in Conflict: Why AI Advisors Fail to Improve Human Coordination.A vailable at SSRN 6350778(2026)

2026

[35] [35]

Jennifer Renoux, Filipa Correia, Joana Campos, Lucas Morillo-Mendez, Neziha Akalin, Fernando P Santos, and Ana Paiva. 2025. The Effect of Agent-Based Feedback on Prosociality in Social Dilemmas. InProceedings of the 24th International Conference on Autonomous Agents and Multiagent Systems. 1755–1763

2025

[36] [36]

Francesco Salvi, Manoel Horta Ribeiro, Riccardo Gallotti, and Robert West. 2024. On the conversational persuasiveness of large language models: A randomized controlled trial. (2024)

2024

[37] [37]

Chenxinran Shen, Jurgis Karpus, Thomas Kosch, Daniela Fernandes, Beatriz Mello, Robin Welsch, and Steeven Villa. 2025. The Impact of Asymmetric AI Assistance on Decision-Making in Social Dilemmas: A Study on Human Augmentation in Economic Games. InProceedings of the Augmented Humans International Conference 2025. 187–198

2025

[38] [38]

Nicholas Stern, Mattia Romani, Roberta Pierfederici, Manuel Braun, Daniel Barraclough, Shajeeshan Lingeswaran, Elizabeth Weirich-Benet, and Niklas Niemann. 2025. Green and intelligent: the role of AI in the climate transition.npj Climate Action4, 1 (2025), 56

2025

[39] [39]

Aron Szekely, Francesca Lipari, Alberto Antonioni, Mario Paolucci, Angel Sánchez, Luca Tummolini, and Giulia Andrighetto. 2021. Evidence from a long-term experiment that collective risks change social norms and promote cooperation.Nature communications12, 1 (2021), 5452

2021

[40] [40]

Alessandro Tavoni, Astrid Dannenberg, Giorgos Kallis, and Andreas Löschel. 2011. Inequality, communication, and the avoidance of disastrous climate change in a public goods game.Proceedings of the National Academy of Sciences108, 29 (2011), 11825–11829

2011

[41] [41]

Michael Henry Tessler, Michiel A Bakker, Daniel Jarrett, Hannah Sheahan, Martin J Chadwick, Raphael Koster, Georgina Evans, Lucy Campbell- Gillingham, Tantum Collins, David C Parkes, et al. 2024. AI can help humans find common ground in democratic deliberation.Science386, 6719 (2024), eadq2852

2024

[42] [42]

Milena Tsvetkova, Taha Yasseri, Niccolo Pescetelli, and Tobias Werner. 2024. A new sociology of humans and machines.Nature Human Behaviour8, 10 (2024), 1864–1876

2024

[43] [43]

Denis Tverskoi, Andrea Guido, Giulia Andrighetto, Angel Sánchez, and Sergey Gavrilets. 2023. Disentangling material, social, and cognitive determinants of human behavior and beliefs.Humanities and Social Sciences Communications10, 1 (2023)

2023

[44] [44]

Jinhua Zhao, Xianjia Wang, Lei Niu, Rui Ding, and Cuiling Gu. 2023. Cooperation in collective risk dilemmas with endogenous heterogeneity: Nash equilibrium and evolutionary dynamics.Chaos, Solitons & Fractals175 (2023), 114009. AI Persuasive Framing in Collective Dilemmas 15 APPENDIX A Experimental Messages A.1 Prognostic Messages (1) The way for the grou...

2023

[45] [45]

I will contribute [initial choice] tokens to the pool

Player made INITIAL PLEDGE: "I will contribute [initial choice] tokens to the pool"

[46] [46]

NOW: This conversation (30 second minimum) - RECONSIDERATION WINDOW

[47] [47]

NEXT: Player makes FINAL CONTRIBUTION DECISION (can change or keep initial pledge)

[48] [48]

changing

Your goal: Influence how much they CONTRIBUTE in their final decision PLAYER PROFILE: 16 Møller et al. - SVO Classification: [SVO classification] - Current Round Initial Pledge: [initial choice] tokens (NOT yet locked in - can still change) - Prior Rounds Final Decisions: [prior decisions] tokens ([number] completed rounds) - Prior Rounds Average: [averag...

[49] [50]

Apply SVO-specific persuasion to guide them toward CONTRIBUTING MORE (giving more to pool)

[50] [51]

Create psychological momentum for CONTRIBUTING MORE in current AND future rounds

[51] [52]

you can still change

Be conversational - don't explicitly say "you can still change" but subtly open the door for reconsideration B.3 SVO-specific Persuasion Strategies for the Cooperative Agent PERSUASION STRATEGY - COOPERATIVE PLAYER: - Reinforce group responsibility and moral duty to team - Appeal to fairness principles and collective success - Frame higher contributions a...

[52] [53]

Engage naturally about how much they initially pledged to contribute and their reasoning

[53] [54]

Apply SVO-specific persuasion to guide them toward CONTRIBUTING LESS (giving less to pool = keeping more money)

[54] [55]

Create psychological momentum for CONTRIBUTING LESS in current AND future rounds

[55] [56]

you can still change

Be conversational - don't explicitly say "you can still change" but subtly open the door for reconsideration

[56] [57]

Frame contributing less as smart, rational, and financially optimal

[57] [58]

don't let others profit from your sacrifice

Downplay disaster risk and emphasize that contributing less = more personal earnings B.5 SVO-specific Persuasion Strategies for the Cooperative Agent PERSUASION STRATEGY - COOPERATIVE PLAYER (SELFISH MODE): - Challenge their cooperative instincts as naive and exploitable - Frame others as free-riders who will take advantage of their generosity - Appeal to...

[58] [62]

The four statements used a five-point Likert scale:Strongly Disagree,Disagree,Neutral,Agree, andStrongly Agree

Most people will act fairly, even when no one is watching. The four statements used a five-point Likert scale:Strongly Disagree,Disagree,Neutral,Agree, andStrongly Agree. C.2 Social Value Orientation test Participants received the following instructions: Imagine the following scenario: you are paired with another anonymous person, and both of you will cho...

[59] [63]

I believe that other people tend to be more cooperative than I am

[60] [64]

I usually trust others when making decisions in group settings

[61] [65]

I am willing to make personal sacrifices to help others

[62] [66]

These items used the same five-point scale:Strongly Disagree,Disagree,Neutral,Agree, andStrongly Agree

Most people will act fairly, even when no one is watching. These items used the same five-point scale:Strongly Disagree,Disagree,Neutral,Agree, andStrongly Agree. D Demographics and Survey Answer Visualizations 20 40 60 80 100 Age (years) 0 50 100 150 200 Count Age Distribution Fig. 9. Age distribution for all participants. Much less Somewhat less No infl...