Browseconf: Confidence-guided test-time scaling for web agents,

· 2025 · arXiv 2510.23458

1 Pith paper cite this work. Polarity classification is still indexing.

1 Pith paper citing it

representative citing papers

Mobile-Aptus: Confidence-Driven Proactive and Robust Interaction in MLLM-based Mobile-Using Agents

cs.CL · 2026-05-27 · unverdicted · novelty 5.0

Mobile-Aptus uses supervised fine-tuning followed by semantic similarity retrieval and direct preference optimization to calibrate confidence scores in mobile agents, yielding over 17% average task success improvement on four benchmarks.

citing papers explorer

Showing 1 of 1 citing paper.

Mobile-Aptus: Confidence-Driven Proactive and Robust Interaction in MLLM-based Mobile-Using Agents cs.CL · 2026-05-27 · unverdicted · none · ref 33
Mobile-Aptus uses supervised fine-tuning followed by semantic similarity retrieval and direct preference optimization to calibrate confidence scores in mobile agents, yielding over 17% average task success improvement on four benchmarks.

Browseconf: Confidence-guided test-time scaling for web agents,

fields

years

verdicts

representative citing papers

citing papers explorer