pith. sign in

Gentel-safe: A uni- fied benchmark and shielding framework for defend- ing against prompt injection attacks

2 Pith papers cite this work. Polarity classification is still indexing.

2 Pith papers citing it

fields

cs.CR 1 cs.LG 1

years

2026 1 2025 1

verdicts

UNVERDICTED 2

clear filters

representative citing papers

Progent: Securing AI Agents with Privilege Control

cs.CR · 2025-04-16 · unverdicted · novelty 6.0

Progent introduces a privilege-control framework for AI agents that uses LLM-generated symbolic rules over tools, SMT-solver-enforced monotonic updates, and deterministic checks to reduce attack success rates on AgentDojo and ASB benchmarks.

citing papers explorer

Showing 1 of 1 citing paper after filters.