From Digital to Physical: Digital Agents as Autonomous Coaches for Physical Intelligence

Bo Zhao; Chuan Wen; Genjia Liu; Jixian Wu; Qipeng Liu; Shanghang Zhang; Siheng Chen; Sixiang Chen; Weixin Li; Wenzhao Lian

arxiv: 2601.21570 · v2 · pith:D4XCIDOLnew · submitted 2026-01-29 · 💻 cs.AI · cs.RO

From Digital to Physical: Digital Agents as Autonomous Coaches for Physical Intelligence

Zixing Lei , Genjia Liu , Yuanshuo Zhang , Qipeng Liu , Yuzhu Cai , Sixiang Chen , Jixian Wu , Yunhong Wang

show 6 more authors

Weixin Li Chuan Wen Bo Zhao Shanghang Zhang Wenzhao Lian Siheng Chen

This is my paper

classification 💻 cs.AI cs.RO

keywords agentsembodiedautonomousdigitalengineeringenvironmentfeedbackfield

0 comments

read the original abstract

The field of Embodied AI is witnessing a rapid evolution toward general-purpose robotic systems, fueled by high-fidelity simulation and large-scale data collection. However, this scaling capability remains severely bottlenecked by a reliance on labor-intensive manual oversight from intricate reward shaping to hyperparameter tuning across heterogeneous backends. Inspired by LLMs' success in software automation and science discovery, we introduce \textsc{EmboCoach-Bench}, a benchmark evaluating the capacity of LLM agents to autonomously engineer embodied policies. Spanning 32 expert-curated RL and IL tasks, our framework posits executable code as the universal interface. We move beyond static generation to assess a dynamic closed-loop workflow, where agents leverage environment feedback to iteratively draft, debug, and optimize solutions, spanning improvements from physics-informed reward design to policy architectures such as diffusion policies. Extensive evaluations yield three critical insights: (1) autonomous agents can qualitatively surpass human-engineered baselines by 26.5\% in average success rate; (2) agentic workflow with environment feedback effectively strengthens policy development and substantially narrows the performance gap between open-source and proprietary models; and (3) agents exhibit self-correction capabilities for pathological engineering cases, successfully resurrecting task performance from near-total failures through iterative simulation-in-the-loop debugging. Ultimately, this work establishes a foundation for self-evolving embodied intelligence, accelerating the paradigm shift from labor-intensive manual tuning to scalable, autonomous engineering in embodied AI field.

This paper has not been read by Pith yet.

discussion (0)

Forward citations

Cited by 1 Pith paper

Reviewed papers in the Pith corpus that reference this work. Sorted by Pith novelty score.

EvoMaster: A Foundational Evolving Agent Framework for Agentic Science at Scale
cs.AI 2026-04 unverdicted novelty 5.0

EvoMaster is a self-evolving agent framework that achieves state-of-the-art results on scientific benchmarks by enabling iterative hypothesis refinement and knowledge accumulation across domains.