VisionClaw: Always-On AI Agents through Smart Glasses

· 2026 · cs.HC · arXiv 2604.03486

2 Pith papers cite this work. Polarity classification is still indexing.

2 Pith papers citing it

open full Pith review browse 2 citing papers arXiv PDF

abstract

We present VisionClaw, an always-on wearable AI agent that integrates live egocentric perception with agentic task execution. Running on Meta Ray-Ban smart glasses, VisionClaw continuously perceives real-world context and enables in-situ, speech-driven action initiation and delegation via OpenClaw AI agents. Therefore, users can directly execute tasks through the smart glasses, such as adding real-world objects to an Amazon cart, generating notes from physical documents, receiving meeting briefings on the go, creating events from posters, or controlling IoT devices. We evaluate VisionClaw through a controlled laboratory study (N=12) and a longitudinal deployment study (N=5). Results show that integrating perception and execution enables faster task completion and reduces interaction overhead compared to non-always-on and non-agent baselines. Beyond performance gains, deployment findings reveal a shift in interaction: tasks are initiated opportunistically during ongoing activities, and execution is increasingly delegated rather than manually controlled. These results suggest a new paradigm for wearable AI agents, where perception and action are continuously coupled to support situated, hands-free interaction.

citation-role summary

background 1

citation-polarity summary

background 1

representative citing papers

SpeakerLLM: A Speaker-Specialized Audio-LLM for Speaker Understanding and Verification Reasoning

cs.SD · 2026-05-14 · unverdicted · novelty 6.0

SpeakerLLM unifies speaker profiling, recording-condition understanding, and structured verification reasoning in an audio-LLM via a hierarchical tokenizer and decision traces.

Position: Life-Logging Video Streams Make the Privacy-Utility Trade-off Inevitable

cs.CV · 2026-05-11 · unverdicted · novelty 4.0

Life-logging video streams create an inevitable privacy-utility trade-off that is a foundational challenge for always-on AI systems.

citing papers explorer

Showing 2 of 2 citing papers.

SpeakerLLM: A Speaker-Specialized Audio-LLM for Speaker Understanding and Verification Reasoning cs.SD · 2026-05-14 · unverdicted · none · ref 3 · internal anchor
SpeakerLLM unifies speaker profiling, recording-condition understanding, and structured verification reasoning in an audio-LLM via a hierarchical tokenizer and decision traces.
Position: Life-Logging Video Streams Make the Privacy-Utility Trade-off Inevitable cs.CV · 2026-05-11 · unverdicted · none · ref 15 · internal anchor
Life-logging video streams create an inevitable privacy-utility trade-off that is a foundational challenge for always-on AI systems.

VisionClaw: Always-On AI Agents through Smart Glasses

citation-role summary

citation-polarity summary

fields

years

verdicts

roles

polarities

representative citing papers

citing papers explorer