Yongchan Kwon, Shang Zhu, Federico Bianchi, Kait- lyn Zhou, and James Zou

Can LLMs simulate personas with reversed performance? A systematic investigation for counterfactual instruction following in math reasoning context · 2025 · arXiv 2504.06460

2 Pith papers cite this work. Polarity classification is still indexing.

2 Pith papers citing it

read on arXiv browse 2 citing papers

representative citing papers

When Built-in Thinking Helps and Hurts: Constraint-Level Error Shifts in Instruction Following

cs.CL · 2026-06-08 · unverdicted · novelty 6.0

Thinking mode in Qwen3 models improves class-level performance on planning constraints but worsens precision constraints in IFEval, with 10-20% prompt-level flips and directional consistency in Hunyuan models.

Prompt Governance? On Governing Technologies Governed by Natural Language

cs.CY · 2026-04-29 · unverdicted · novelty 4.0

Literature on system prompts for AI shows fragmented and contradictory claims that complicate policy efforts to use them as reliable governance mechanisms.

citing papers explorer

Showing 1 of 1 citing paper after filters.

Prompt Governance? On Governing Technologies Governed by Natural Language cs.CY · 2026-04-29 · unverdicted · none · ref 175
Literature on system prompts for AI shows fragmented and contradictory claims that complicate policy efforts to use them as reliable governance mechanisms.

Yongchan Kwon, Shang Zhu, Federico Bianchi, Kait- lyn Zhou, and James Zou

fields

years

verdicts

representative citing papers

citing papers explorer