u ttler, Mike Lewis, Wen-tau Yih, Tim Rockt\

Douwe Kiela · 2020

2 Pith papers cite this work. Polarity classification is still indexing.

2 Pith papers citing it

browse 2 citing papers

representative citing papers

Agent Security Bench (ASB): Formalizing and Benchmarking Attacks and Defenses in LLM-based Agents

cs.CR · 2024-10-03 · unverdicted · novelty 7.0

ASB is a new benchmark that tests 10 prompt injection attacks, memory poisoning, a novel Plan-of-Thought backdoor attack, and 11 defenses on LLM agents across 13 models, finding attack success rates up to 84.3% and limited defense effectiveness.

Reinforced Graph of Thoughts: RL-Driven Adaptive Prompting for LLMs

cs.LG · 2026-05-21 · unverdicted · novelty 5.0

RGoT uses RL to adaptively generate task-specific graphs of operations for GoT-style LLM prompting from a human-provided set, with results suggesting feasibility under constraints.

citing papers explorer

Showing 2 of 2 citing papers.

Agent Security Bench (ASB): Formalizing and Benchmarking Attacks and Defenses in LLM-based Agents cs.CR · 2024-10-03 · unverdicted · none · ref 117
ASB is a new benchmark that tests 10 prompt injection attacks, memory poisoning, a novel Plan-of-Thought backdoor attack, and 11 defenses on LLM agents across 13 models, finding attack success rates up to 84.3% and limited defense effectiveness.
Reinforced Graph of Thoughts: RL-Driven Adaptive Prompting for LLMs cs.LG · 2026-05-21 · unverdicted · none · ref 29
RGoT uses RL to adaptively generate task-specific graphs of operations for GoT-style LLM prompting from a human-provided set, with results suggesting feasibility under constraints.

u ttler, Mike Lewis, Wen-tau Yih, Tim Rockt\

fields

years

verdicts

representative citing papers

citing papers explorer