From Gaze to Guidance: Interpreting and Adapting to Users' Cognitive Needs with Multimodal Gaze-Aware AI Assistants

· 2026 · cs.HC · arXiv 2604.08062

1 Pith paper cite this work. Polarity classification is still indexing.

1 Pith paper citing it

open full Pith review browse 1 citing papers arXiv PDF

abstract

Current LLM assistants are powerful at answering questions, but they have limited access to the behavioral context that reveals when and where a user is struggling. We present a gaze-grounded multimodal LLM assistant that uses egocentric video with gaze overlays to identify likely points of difficulty and target follow-up retrospective assistance. We instantiate this vision in a controlled study (n=36) comparing the gaze-aware AI assistant to a text-only LLM assistant. Compared to a conventional LLM assistant, the gaze-aware assistant was rated as significantly more accurate and personalized in its assessments of users' reading behavior and significantly improved people's ability to recall information. Users spoke significantly fewer words with the gaze-aware assistant, indicating more efficient interactions. Qualitative results underscored both perceived benefits in comprehension and challenges when interpretations of gaze behaviors were inaccurate. Our findings suggest that gaze-aware LLM assistants can reason about cognitive needs to improve cognitive outcomes of users.

representative citing papers

Gaze-Informed Proactive AI Assistance for Children's Picture Exploration

cs.HC · 2026-07-01 · unverdicted · novelty 5.0

Gaze-informed proactive LLM assistance maintained children's attention on picture regions longer and guided exploration to related areas more effectively than random assistance in a within-subject study.

citing papers explorer

Showing 1 of 1 citing paper.

Gaze-Informed Proactive AI Assistance for Children's Picture Exploration cs.HC · 2026-07-01 · unverdicted · none · ref 11 · internal anchor
Gaze-informed proactive LLM assistance maintained children's attention on picture regions longer and guided exploration to related areas more effectively than random assistance in a within-subject study.

From Gaze to Guidance: Interpreting and Adapting to Users' Cognitive Needs with Multimodal Gaze-Aware AI Assistants

fields

years

verdicts

representative citing papers

citing papers explorer