Minor perturbations in persona format, instruction framing, and network structure shift cooperation by up to 76 percentage points and polarization metrics consistently, showing that LLM social simulations require per-claim robustness audits via the new TRAILS taxonomy.
Mosaic: Modeling social ai for content dissemination and regulation in multi-agent simulations
4 Pith papers cite this work. Polarity classification is still indexing.
citation-role summary
citation-polarity summary
years
2026 4roles
method 1polarities
use method 1representative citing papers
A distribution-correction framework for offline LLM reasoning distillation improves accuracy on math benchmarks by adaptively aligning teacher supervision with the student's inference-time distribution.
MemEye benchmark evaluates multimodal memory on visual granularity and evidence synthesis, finding that 13 methods across 4 VLMs struggle with fine details and temporal state changes.
Pneuma-Seeker is an agentic system that reifies underspecified information needs on relational data into explicit relational specifications for iterative refinement, targeted discovery, and provenance-aware execution.
citing papers explorer
-
MemEye: A Visual-Centric Evaluation Framework for Multimodal Agent Memory
MemEye benchmark evaluates multimodal memory on visual granularity and evidence synthesis, finding that 13 methods across 4 VLMs struggle with fine details and temporal state changes.