pith. machine review for the scientific record. sign in

arxiv: 2512.09923 · v2 · submitted 2025-12-10 · 💻 cs.CV

Recognition: unknown

Splatent: Splatting Diffusion Latents for Novel View Synthesis

Eli Alshan, Frederic Devernay, Ianir Ideses, Inbar Huberman-Spiegelglas, Lior Fritz, Netalee Efrat, Omer Sela, Or Hirschorn, Yochai Zvik

Authors on Pith no claims yet
classification 💻 cs.CV
keywords reconstructionlatentspacesplatentdetailsdiffusionvaesdetail
0
0 comments X
read the original abstract

Radiance field representations have recently been explored in the latent space of VAEs that are commonly used by diffusion models. This direction offers efficient rendering and seamless integration with diffusion-based pipelines. However, these methods face a fundamental limitation: The VAE latent space lacks multi-view consistency, leading to blurred textures and missing details during 3D reconstruction. Existing approaches attempt to address this by fine-tuning the VAE, at the cost of reconstruction quality, or by relying on pre-trained diffusion models to recover fine-grained details, at the risk of some hallucinations. We present Splatent, a diffusion-based enhancement framework designed to operate on top of 3D Gaussian Splatting (3DGS) in the latent space of VAEs. Our key insight departs from the conventional 3D-centric view: rather than reconstructing fine-grained details in 3D space, we recover them in 2D from input views through multi-view attention mechanisms. This approach preserves the reconstruction quality of pretrained VAEs while achieving faithful detail recovery. Evaluated across multiple benchmarks, Splatent establishes a new state-of-the-art for VAE latent radiance field reconstruction. We further demonstrate that integrating our method with existing feed-forward frameworks, consistently improves detail preservation, opening new possibilities for high-quality sparse-view 3D reconstruction. Code is available on our project page: https://orhir.github.io/Splatent/

This paper has not been read by Pith yet.

discussion (0)

Sign in with ORCID, Apple, or X to comment. Anyone can read and Pith papers without signing in.

Forward citations

Cited by 1 Pith paper

Reviewed papers in the Pith corpus that reference this work. Sorted by Pith novelty score.

  1. GeoQuery: Geometry-Query Diffusion for Sparse-View Reconstruction

    cs.CV 2026-05 unverdicted novelty 6.0

    GeoQuery replaces corrupted rendering features with geometry-aligned proxy queries and restricts cross-view attention to local windows, enabling robust diffusion-based refinement under extreme view sparsity.