In particular, the Agent should inform the User of the potential risks and seek User's permission or confirmation before executing risky tool calls

Avoid Risky Tool Call Requirement: The Agent should refrain from directly executing tool calls with potential risks

1 Pith paper cite this work. Polarity classification is still indexing.

1 Pith paper citing it

browse 1 citing papers

representative citing papers

InjecAgent: Benchmarking Indirect Prompt Injections in Tool-Integrated Large Language Model Agents

cs.CL · 2024-03-05 · conditional · novelty 6.0

InjecAgent benchmark demonstrates that tool-integrated LLM agents are vulnerable to indirect prompt injection attacks, with ReAct-prompted GPT-4 succeeding on 24% of attacks and nearly twice that rate when attacker instructions are reinforced.

citing papers explorer

Showing 1 of 1 citing paper.

InjecAgent: Benchmarking Indirect Prompt Injections in Tool-Integrated Large Language Model Agents cs.CL · 2024-03-05 · conditional · none · ref 14
InjecAgent benchmark demonstrates that tool-integrated LLM agents are vulnerable to indirect prompt injection attacks, with ReAct-prompted GPT-4 succeeding on 24% of attacks and nearly twice that rate when attacker instructions are reinforced.

In particular, the Agent should inform the User of the potential risks and seek User's permission or confirmation before executing risky tool calls

fields

years

verdicts

representative citing papers

citing papers explorer