ISBN 9798400704901

Zihan Dong, Xinyu Fan, Zhiyuan Peng · 2024 · arXiv 7528.367162

4 Pith papers cite this work. Polarity classification is still indexing.

4 Pith papers citing it

representative citing papers

Mango: Multi-Agent Web Navigation via Global-View Optimization

cs.CL · 2026-04-20 · unverdicted · novelty 6.0

Mango raises web agent success rates to 63.6% on WebVoyager and 52.5% on WebWalkerQA by bandit-based starting-point selection and memory, beating baselines by 7.3% and 26.8%.

ContractSkill: Repairable Contract-Based Skills for Multimodal Web Agents

cs.SE · 2026-03-20 · unverdicted · novelty 6.0

ContractSkill converts draft web agent skills into explicit executable contracts that enable deterministic verification, fault localization, and minimal local repair, improving stability on benchmarks like VisualWebArena.

DynaWeb: Model-Based Reinforcement Learning of Web Agents

cs.CL · 2026-01-29 · unverdicted · novelty 6.0

DynaWeb introduces a model-based RL framework that trains web agents via imagined rollouts in a learned web world model interleaved with real expert trajectories, yielding consistent gains on WebArena and WebVoyager benchmarks.

BacktestBench: Benchmarking Large Language Models for Automated Quantitative Strategy Backtesting

cs.CL · 2026-05-18

citing papers explorer

Showing 4 of 4 citing papers.

Mango: Multi-Agent Web Navigation via Global-View Optimization cs.CL · 2026-04-20 · unverdicted · none · ref 1
Mango raises web agent success rates to 63.6% on WebVoyager and 52.5% on WebWalkerQA by bandit-based starting-point selection and memory, beating baselines by 7.3% and 26.8%.
ContractSkill: Repairable Contract-Based Skills for Multimodal Web Agents cs.SE · 2026-03-20 · unverdicted · none · ref 13
ContractSkill converts draft web agent skills into explicit executable contracts that enable deterministic verification, fault localization, and minimal local repair, improving stability on benchmarks like VisualWebArena.
DynaWeb: Model-Based Reinforcement Learning of Web Agents cs.CL · 2026-01-29 · unverdicted · none · ref 34
DynaWeb introduces a model-based RL framework that trains web agents via imagined rollouts in a learned web world model interleaved with real expert trajectories, yielding consistent gains on WebArena and WebVoyager benchmarks.
BacktestBench: Benchmarking Large Language Models for Automated Quantitative Strategy Backtesting cs.CL · 2026-05-18 · unreviewed · ref 7

ISBN 9798400704901

fields

years

verdicts

representative citing papers

citing papers explorer