pith. sign in

arxiv: 2601.04126 · v3 · pith:TMJ7VDTZnew · submitted 2026-01-07 · 💻 cs.CL · cs.AI· cs.CV

InfiniteWeb: Scalable Web Environment Synthesis for GUI Agent Training

classification 💻 cs.CL cs.AIcs.CV
keywords agentsenvironmentsinfinitewebsystemtrainingwebsiteagentchallenges
0
0 comments X
read the original abstract

GUI agents that interact with graphical interfaces on behalf of users represent a promising direction for practical AI assistants. However, training such agents is hindered by the scarcity of suitable environments. We present InfiniteWeb, a system that automatically generates functional web environments at scale for GUI agent training. While LLMs perform well on generating a single webpage, building a realistic and functional website with many interconnected pages faces challenges. We address these challenges through unified specification, task-centric test-driven development, and a combination of website seed with reference design image to ensure diversity. Our system also generates verifiable task evaluators enabling dense reward signals for reinforcement learning. Experiments show that InfiniteWeb surpasses commercial coding agents at realistic website construction, and GUI agents trained on our generated environments achieve significant performance improvements on OSWorld and Online-Mind2Web, demonstrating the effectiveness of proposed system.

This paper has not been read by Pith yet.

discussion (0)

Sign in with ORCID, Apple, or X to comment. Anyone can read and Pith papers without signing in.

Forward citations

Cited by 7 Pith papers

Reviewed papers in the Pith corpus that reference this work. Sorted by Pith novelty score.

  1. CUA-Gym: Scaling Verifiable Training Environments and Tasks for Computer-Use Agents

    cs.AI 2026-05 conditional novelty 7.0

    CUA-Gym generates 32,112 verified RLVR tuples across 110 mock environments, enabling trained models to reach 62.1% and 72.6% on OSWorld-Verified while transferring to WebArena.

  2. PhoneWorld: Scaling Phone-Use Agent Environments

    cs.CL 2026-05 unverdicted novelty 6.0

    PhoneWorld is a pipeline that converts real mobile trajectories into scalable controllable environments, yielding large gains on four benchmarks when used to supplement training data.

  3. OpenComputer: Verifiable Software Worlds for Computer-Use Agents

    cs.AI 2026-05 unverdicted novelty 6.0

    OpenComputer introduces a verifier-grounded framework with state verifiers, self-evolving layers, task synthesis, and auditable evaluation for 33 desktop apps and 1000 tasks to support computer-use AI agents.

  4. Learning to Build the Environment: Self-Evolving Reasoning RL via Verifiable Environment Synthesis

    cs.AI 2026-05 unverdicted novelty 6.0

    EvoEnv lets a single policy synthesize, validate, and use Python environments with durable solve-verify asymmetry to improve reasoning performance on Qwen3-4B-Thinking from 72.4 to 74.8 while fixed-data baselines decline.

  5. Agent-World: Scaling Real-World Environment Synthesis for Evolving General Agent Intelligence

    cs.AI 2026-04 unverdicted novelty 6.0

    Agent-World autonomously synthesizes verifiable real-world tasks and uses continuous self-evolution to train 8B and 14B agents that outperform proprietary models on 23 benchmarks.

  6. Agentic Environment Engineering for Large Language Models: A Survey of Environment Modeling, Synthesis, Evaluation, and Application

    cs.CL 2026-06 unverdicted novelty 5.0

    This survey categorizes agentic environments for LLMs by eight attributes and domains, introduces symbolic and neural synthesis paradigms with evaluation, and outlines four agent evolution pathways plus three environm...

  7. Scalable Environments Drive Generalizable Agents

    cs.AI 2026-05 unverdicted novelty 5.0

    Generalizable agents require environment scaling via diverse executable rule-sets, distinguished from trajectory and task scaling in a new taxonomy.