pith. sign in

System prompt poisoning: Persistent attacks on large language models beyond user injection

3 Pith papers cite this work. Polarity classification is still indexing.

3 Pith papers citing it

citation-role summary

background 1

citation-polarity summary

years

2026 2 2025 1

verdicts

UNVERDICTED 3

roles

background 1

polarities

background 1

clear filters

representative citing papers

The Surface You Test Is Not the Surface That Breaks

cs.CR · 2026-05-28 · unverdicted · novelty 6.0

Prompt injection vulnerability in tool-augmented LLMs is a model-surface interaction rather than a fixed channel property; the same payload inverts success rates across models, and adaptive attack rate exceeds single-surface baselines by 9.1 pp on average.

citing papers explorer

Showing 2 of 2 citing papers after filters.

  • The Surface You Test Is Not the Surface That Breaks cs.CR · 2026-05-28 · unverdicted · none · ref 13

    Prompt injection vulnerability in tool-augmented LLMs is a model-surface interaction rather than a fixed channel property; the same payload inverts success rates across models, and adaptive attack rate exceeds single-surface baselines by 9.1 pp on average.

  • Prompt Governance? On Governing Technologies Governed by Natural Language cs.CY · 2026-04-29 · unverdicted · none · ref 196

    Literature on system prompts for AI shows fragmented and contradictory claims that complicate policy efforts to use them as reliable governance mechanisms.