Collective AI can amplify tiny perturbations into divergent decisions

· 2026 · cs.AI · arXiv 2603.09127

1 Pith paper cite this work. Polarity classification is still indexing.

1 Pith paper citing it

open full Pith review browse 1 citing papers arXiv PDF

abstract

Large language models are increasingly deployed not as single assistants but as committees whose members deliberate and then vote or synthesize a decision. Such systems are often expected to be more robust than individual models. We show that iterative multi-LLM deliberation can instead amplify tiny perturbations into divergent conversational trajectories and different final decisions. In a fully deterministic self-hosted benchmark, exact reruns are identical, yet small meaning-preserving changes to the scenario text still separate over time and often alter the final recommendation. In deployed black-box API systems, nominally identical committee runs likewise remain unstable even at temperature 0, where many users expect near-determinism. Across 12 policy scenarios, these findings indicate that instability in collective AI is not only a consequence of residual platform-side stochasticity, but can arise from sensitivity to nearby initial conditions under repeated interaction itself. Additional deployed experiments show that committee architecture modulates this instability: role structure, model composition, and feedback memory can each alter the degree of divergence. Collective AI therefore faces a stability problem, not only an accuracy problem: deterministic execution alone does not guarantee predictable or auditable deliberative outcomes.

representative citing papers

Latent Trajectory Dynamics in Large Language Models: A Manifold Evolution Framework with Empirical Validation

cs.CL · 2025-05-24 · unverdicted · novelty 6.0

DMET models LLM generation as controlled dynamical trajectories on a semantic manifold, with three proxy metrics that predict output quality and support adaptive decoding to lower perplexity.

citing papers explorer

Showing 1 of 1 citing paper.

Latent Trajectory Dynamics in Large Language Models: A Manifold Evolution Framework with Empirical Validation cs.CL · 2025-05-24 · unverdicted · none · ref 14 · internal anchor
DMET models LLM generation as controlled dynamical trajectories on a semantic manifold, with three proxy metrics that predict output quality and support adaptive decoding to lower perplexity.

Collective AI can amplify tiny perturbations into divergent decisions

fields

years

verdicts

representative citing papers

citing papers explorer