Title resolution pending

The task description

2 Pith papers cite this work. Polarity classification is still indexing.

2 Pith papers citing it

browse 2 citing papers

Title metadata for this work has not finished resolving. The hub is built from the citation graph; the title resolver retries DOI and OpenAlex on its next pass.

representative citing papers

Android Coach: Improve Online Agentic Training Efficiency with Single State Multiple Actions

cs.LG · 2026-04-08 · unverdicted · novelty 7.0 · 2 refs

Android Coach improves online agent training efficiency by enabling multiple actions per state via a critic-based coach, process reward model, and group-wise advantage estimation, delivering 7.5-8.3% success rate gains and 1.4x efficiency over PPO/GRPO baselines.

Step-level Optimization for Efficient Computer-use Agents

cs.AI · 2026-04-29 · unverdicted · novelty 6.0

A modular step-level cascade uses Stuck and Milestone monitors to switch between small and large policies in computer-use agents, turning uniform expensive inference into on-demand allocation.

citing papers explorer

Showing 2 of 2 citing papers.

Android Coach: Improve Online Agentic Training Efficiency with Single State Multiple Actions cs.LG · 2026-04-08 · unverdicted · none · ref 6 · 2 links
Android Coach improves online agent training efficiency by enabling multiple actions per state via a critic-based coach, process reward model, and group-wise advantage estimation, delivering 7.5-8.3% success rate gains and 1.4x efficiency over PPO/GRPO baselines.
Step-level Optimization for Efficient Computer-use Agents cs.AI · 2026-04-29 · unverdicted · none · ref 7
A modular step-level cascade uses Stuck and Milestone monitors to switch between small and large policies in computer-use agents, turning uniform expensive inference into on-demand allocation.

Title resolution pending

fields

years

verdicts

representative citing papers

citing papers explorer