In: Proceedings of the AAAI Confer- ence on Artificial Intelligence

Hang Hua, Yolo Yunlong Tang, Chenliang Xu, Jiebo Luo · 2025 · DOI 10.1609/aaai.v39i4.32374

2 Pith papers cite this work. Polarity classification is still indexing.

2 Pith papers citing it

open at publisher browse 2 citing papers

representative citing papers

Agent Skills Should Go Beyond Text: The Case for Visual Skills

cs.CV · 2026-05-31 · unverdicted · novelty 5.0

The paper proposes that reusable agent skills should incorporate visual elements alongside text, introduces three forms of visual skills and an automatic conversion system, and reports better performance on GUI and visual-centric tasks.

LVSum: A Benchmark for Timestamp-Aware Long Video Summarization

cs.CV · 2026-04-11 · unverdicted · novelty 5.0

LVSum is a new benchmark for timestamp-aware long video summarization that exposes systematic temporal gaps in existing multimodal large language models.

citing papers explorer

Showing 2 of 2 citing papers.

Agent Skills Should Go Beyond Text: The Case for Visual Skills cs.CV · 2026-05-31 · unverdicted · none · ref 18
The paper proposes that reusable agent skills should incorporate visual elements alongside text, introduces three forms of visual skills and an automatic conversion system, and reports better performance on GUI and visual-centric tasks.
LVSum: A Benchmark for Timestamp-Aware Long Video Summarization cs.CV · 2026-04-11 · unverdicted · none · ref 10
LVSum is a new benchmark for timestamp-aware long video summarization that exposes systematic temporal gaps in existing multimodal large language models.

In: Proceedings of the AAAI Confer- ence on Artificial Intelligence

fields

years

verdicts

representative citing papers

citing papers explorer