Semantic Router: On the Feasibility of Hijacking MLLMs via a Single Adversarial Perturbation

Changyue Li; Jiaming He; Jiaying Li; Pinjia He; Youliang Yuan; Zhicong Huang

arxiv: 2511.20002 · v3 · pith:46J7YYE6new · submitted 2025-11-25 · 💻 cs.CV · cs.AI· cs.CR

Semantic Router: On the Feasibility of Hijacking MLLMs via a Single Adversarial Perturbation

Changyue Li , Jiaying Li , Youliang Yuan , Jiaming He , Zhicong Huang , Pinjia He This is my paper

classification 💻 cs.CV cs.AIcs.CR

keywords feasibilityhijackingmllmsperturbationsingleattackroutersemantic

0 comments

read the original abstract

Multimodal Large Language Models (MLLMs) are increasingly deployed in stateless systems, such as autonomous driving and robotics. This paper investigates a novel threat: Semantic-Aware Hijacking. We explore the feasibility of hijacking multiple stateless decisions simultaneously using a single universal perturbation. We introduce the Semantic-Aware Universal Perturbation (SAUP), which acts as a semantic router, "actively" perceiving input semantics and routing them to distinct, attacker-defined targets. To achieve this, we conduct theoretical and empirical analysis on the geometric properties in the latent space. Guided by these insights, we propose the Semantic-Oriented (SORT) optimization strategy and annotate a new dataset with fine-grained semantics to evaluate performance. Extensive experiments on three representative MLLMs demonstrate the fundamental feasibility of this attack, achieving a 66% attack success rate over five targets using a single frame against Qwen.

This paper has not been read by Pith yet.

Semantic Router: On the Feasibility of Hijacking MLLMs via a Single Adversarial Perturbation

discussion (0)