pith. sign in

arXiv preprint arXiv:2012.11543 , year=

1 Pith paper cite this work. Polarity classification is still indexing.

1 Pith paper citing it

fields

cs.LG 1

years

2026 1

verdicts

UNVERDICTED 1

representative citing papers

Sample-Efficient Post-Training for LEGO Spatial-Physics Reasoning

cs.LG · 2026-05-29 · unverdicted · novelty 5.0

PVPO is a sample-efficient RL method that improves semantic, geometric, and physical quality in LLM LEGO assembly generation by mitigating the PhysHack failure mode where validity alone fails to ensure fidelity.

citing papers explorer

Showing 1 of 1 citing paper.

  • Sample-Efficient Post-Training for LEGO Spatial-Physics Reasoning cs.LG · 2026-05-29 · unverdicted · none · ref 22

    PVPO is a sample-efficient RL method that improves semantic, geometric, and physical quality in LLM LEGO assembly generation by mitigating the PhysHack failure mode where validity alone fails to ensure fidelity.