BrowseConf: Confidence-guided test-time scaling for web agents,

· 2025 · arXiv 2510.23458

2 Pith papers cite this work. Polarity classification is still indexing.

2 Pith papers citing it

representative citing papers

Uncertainty Decomposition for Clarification Seeking in LLM Agents

cs.AI · 2026-06-17 · unverdicted · novelty 6.0

A prompt-based uncertainty decomposition separates action confidence from request uncertainty to enable clarification seeking in LLM agents, yielding F1 gains of 73% and 36% over baselines on two new underspecified benchmarks across five models.

Mobile-Aptus: Confidence-Driven Proactive and Robust Interaction in MLLM-based Mobile-Using Agents

cs.CL · 2026-05-27 · unverdicted · novelty 5.0

Mobile-Aptus uses supervised fine-tuning followed by semantic similarity retrieval and direct preference optimization to calibrate confidence scores in mobile agents, yielding over 17% average task success improvement on four benchmarks.

citing papers explorer

Showing 2 of 2 citing papers after filters.

Uncertainty Decomposition for Clarification Seeking in LLM Agents cs.AI · 2026-06-17 · unverdicted · none · ref 34
A prompt-based uncertainty decomposition separates action confidence from request uncertainty to enable clarification seeking in LLM agents, yielding F1 gains of 73% and 36% over baselines on two new underspecified benchmarks across five models.
Mobile-Aptus: Confidence-Driven Proactive and Robust Interaction in MLLM-based Mobile-Using Agents cs.CL · 2026-05-27 · unverdicted · none · ref 33
Mobile-Aptus uses supervised fine-tuning followed by semantic similarity retrieval and direct preference optimization to calibrate confidence scores in mobile agents, yielding over 17% average task success improvement on four benchmarks.

BrowseConf: Confidence-guided test-time scaling for web agents,

fields

years

verdicts

representative citing papers

citing papers explorer