Closed-Loop Bidirectional Prompting for Adversarial Robustness of Vision Language Models

Boci Peng; Boren Hu; Jiaxiang Liu; Liming Zhang; Mingkun Xu; Prayag Tiwari; Xiao Liu; Xiwen Chen; Yusong Wang

arxiv: 2605.25922 · v1 · pith:X2TI5GDCnew · submitted 2026-05-25 · 💻 cs.CV

Closed-Loop Bidirectional Prompting for Adversarial Robustness of Vision Language Models

Xiao Liu , Jiaxiang Liu , Boci Peng , Boren Hu , Yusong Wang , Xiwen Chen , Prayag Tiwari , Liming Zhang

show 1 more author

Mingkun Xu

This is my paper

classification 💻 cs.CV

keywords adversarialbidirectionalcross-modalclosed-loopdefenseslanguagemodelsprompting

0 comments

read the original abstract

Vision Language Models adapt well to downstream tasks but are highly vulnerable to adversarial perturbations that disrupt cross-modal semantic alignment. Existing defenses are largely unidirectional or structural, failing to exploit bidirectional cross-modal complementarity and instance-wise adaptive protection. To overcome the limitations of unidirectional and static defenses in adversarial settings, we propose Closed-Loop Bidirectional Prompting, casting robust adaptation as cross-modal agreement recovery via a dynamic feedback loop on frozen encoders. A Semantic Anchor is introduced as a stable prior to constrain cyclic updates and mitigate perturbation-induced feature corruption. Through anchor-based bootstrapping, textual semantics denoise visual representations, while the refined visuals enable instance-adaptive prompt updating, yielding a rectified and robust consensus. Extensive evaluations across 11 datasets validate state-of-the-art robustness and strong base-to-new generalization, while maintaining a favorable trade-off between computational cost and accuracy.

This paper has not been read by Pith yet.

Closed-Loop Bidirectional Prompting for Adversarial Robustness of Vision Language Models

discussion (0)