emnlp-main.595/

URL https://aclanthology · 2021 · DOI 10.1609/aaai.v39i25

1 Pith paper cite this work. Polarity classification is still indexing.

1 Pith paper citing it

open at publisher browse 1 citing papers

representative citing papers

STARE: Step-wise Temporal Alignment and Red-teaming Engine for Multi-modal Toxicity Attack

cs.CR · 2026-05-01 · unverdicted · novelty 8.0

STARE uses step-wise RL to attack multimodal models, achieving 68% higher attack success rate while revealing that adversarial optimization concentrates conceptual toxicity early and detail toxicity late in the generation trajectory.

citing papers explorer

Showing 1 of 1 citing paper.

STARE: Step-wise Temporal Alignment and Red-teaming Engine for Multi-modal Toxicity Attack cs.CR · 2026-05-01 · unverdicted · none · ref 3
STARE uses step-wise RL to attack multimodal models, achieving 68% higher attack success rate while revealing that adversarial optimization concentrates conceptual toxicity early and detail toxicity late in the generation trajectory.

emnlp-main.595/

fields

years

verdicts

representative citing papers

citing papers explorer