pith. sign in

arxiv: 2502.12231 · v2 · pith:NYVZZ6EEnew · submitted 2025-02-17 · 💻 cs.CV

PUGS: Zero-shot Physical Understanding with Gaussian Splatting

classification 💻 cs.CV
keywords gaussianphysicalpugssplattingunderstandingzero-shotchallengingduring
0
0 comments X
read the original abstract

Current robotic systems can understand the categories and poses of objects well. But understanding physical properties like mass, friction, and hardness, in the wild, remains challenging. We propose a new method that reconstructs 3D objects using the Gaussian splatting representation and predicts various physical properties in a zero-shot manner. We propose two techniques during the reconstruction phase: a geometry-aware regularization loss function to improve the shape quality and a region-aware feature contrastive loss function to promote region affinity. Two other new techniques are designed during inference: a feature-based property propagation module and a volume integration module tailored for the Gaussian representation. Our framework is named as zero-shot physical understanding with Gaussian splatting, or PUGS. PUGS achieves new state-of-the-art results on the standard benchmark of ABO-500 mass prediction. We provide extensive quantitative ablations and qualitative visualization to demonstrate the mechanism of our designs. We show the proposed methodology can help address challenging real-world grasping tasks. Our codes, data, and models are available at https://github.com/EverNorif/PUGS

This paper has not been read by Pith yet.

discussion (0)

Sign in with ORCID, Apple, or X to comment. Anyone can read and Pith papers without signing in.

Forward citations

Cited by 3 Pith papers

Reviewed papers in the Pith corpus that reference this work. Sorted by Pith novelty score.

  1. FieryGS: In-the-Wild Fire Synthesis with Physics-Integrated Gaussian Splatting

    cs.GR 2026-04 unverdicted novelty 7.0

    FieryGS integrates LLM-based material reasoning, volumetric combustion simulation, and a unified renderer with 3D Gaussian Splatting to generate physically plausible and user-controllable fire in in-the-wild scenes.

  2. Physically Guided Visual Mass Estimation from a Single RGB Image

    cs.CV 2026-01 unverdicted novelty 7.0

    A method estimates mass from single RGB images by fusing depth-based volume cues with vision-language model density semantics via adaptive gating and separate regression heads trained on mass labels only.

  3. PhysGraph: A Physics-aware 3D Scene Graph for Perception and Reasoning

    cs.RO 2026-06 unverdicted novelty 5.0

    PhysGraph reconstructs object-centric 3D geometry from RGB-D, decomposes objects into parts, infers materials and articulations via visual reasoning, and reports SOTA results on semantic segmentation, multi-object mas...