Open captchaworld: A comprehensive web-based platform for testing and benchmarking multimodal llm agents

Yaxin Luo, Zhaoyi Li, Jiacheng Liu, Jiacheng Cui, Xiaohan Zhao, Zhiqiang Shen · 2025 · arXiv 2505.24878

2 Pith papers cite this work. Polarity classification is still indexing.

2 Pith papers citing it

read on arXiv browse 2 citing papers

citation-role summary

dataset 1

citation-polarity summary

use dataset 1

representative citing papers

COGNITION: From Evaluation to Defense against Multimodal LLM CAPTCHA Solvers

cs.CR · 2025-12-02 · conditional · novelty 6.0

Multimodal LLMs reliably solve many CAPTCHA tasks but can be defended by adding fine-grained localization and implicit counting that drops state-of-the-art success from over 95% to 0%.

Agentic AI Security: Threats, Defenses, Evaluation, and Open Challenges

cs.AI · 2025-10-27 · unverdicted · novelty 4.0

A survey that taxonomizes threats to agentic AI, reviews benchmarks and evaluation methods, discusses technical and governance defenses, and identifies open challenges.

citing papers explorer

Showing 2 of 2 citing papers.

COGNITION: From Evaluation to Defense against Multimodal LLM CAPTCHA Solvers cs.CR · 2025-12-02 · conditional · none · ref 14
Multimodal LLMs reliably solve many CAPTCHA tasks but can be defended by adding fine-grained localization and implicit counting that drops state-of-the-art success from over 95% to 0%.
Agentic AI Security: Threats, Defenses, Evaluation, and Open Challenges cs.AI · 2025-10-27 · unverdicted · none · ref 167
A survey that taxonomizes threats to agentic AI, reviews benchmarks and evaluation methods, discusses technical and governance defenses, and identifies open challenges.

Open captchaworld: A comprehensive web-based platform for testing and benchmarking multimodal llm agents

citation-role summary

citation-polarity summary

fields

years

verdicts

roles

polarities

representative citing papers

citing papers explorer