Speed3r: Sparse feed-forward 3d reconstruction models

Weining Ren, Xiao Tan, Kai Han · 2026 · arXiv 2603.08055

4 Pith papers cite this work. Polarity classification is still indexing.

4 Pith papers citing it

read on arXiv browse 4 citing papers

citation-role summary

background 2

citation-polarity summary

background 2

representative citing papers

VGGT-Edit: Feed-forward Native 3D Scene Editing with Residual Field Prediction

cs.CV · 2026-05-14 · unverdicted · novelty 7.0

VGGT-Edit proposes a native 3D text-conditioned editing framework using depth-synchronized injection and residual field prediction, plus the DeltaScene dataset, outperforming 2D-lifting methods.

TurboVGGT: Fast Visual Geometry Reconstruction with Adaptive Alternating Attention

cs.CV · 2026-05-14 · unverdicted · novelty 7.0

TurboVGGT uses adaptive sparse global attention with varying sparsity levels across frames and layers plus frame attention to enable faster multi-view 3D reconstruction while keeping competitive quality versus prior state-of-the-art methods.

Feed-Forward 3D Scene Modeling: A Problem-Driven Perspective

cs.CV · 2026-04-15 · unverdicted · novelty 6.0

The paper proposes a problem-driven taxonomy for feed-forward 3D scene modeling that groups methods by five core challenges: feature enhancement, geometry awareness, model efficiency, augmentation strategies, and temporal-aware modeling.

Lite3R: A Model-Agnostic Framework for Efficient Feed-Forward 3D Reconstruction

cs.CV · 2026-05-12 · unverdicted · novelty 5.0

Lite3R cuts latency by 1.7-2.0x and memory by 1.9-2.4x in feed-forward 3D reconstruction using sparse linear attention and FP8-aware quantization-aware training while keeping competitive quality on backbones like VGGT and DA3-Large.

citing papers explorer

Showing 4 of 4 citing papers.

VGGT-Edit: Feed-forward Native 3D Scene Editing with Residual Field Prediction cs.CV · 2026-05-14 · unverdicted · none · ref 9
VGGT-Edit proposes a native 3D text-conditioned editing framework using depth-synchronized injection and residual field prediction, plus the DeltaScene dataset, outperforming 2D-lifting methods.
TurboVGGT: Fast Visual Geometry Reconstruction with Adaptive Alternating Attention cs.CV · 2026-05-14 · unverdicted · none · ref 30
TurboVGGT uses adaptive sparse global attention with varying sparsity levels across frames and layers plus frame attention to enable faster multi-view 3D reconstruction while keeping competitive quality versus prior state-of-the-art methods.
Feed-Forward 3D Scene Modeling: A Problem-Driven Perspective cs.CV · 2026-04-15 · unverdicted · none · ref 170
The paper proposes a problem-driven taxonomy for feed-forward 3D scene modeling that groups methods by five core challenges: feature enhancement, geometry awareness, model efficiency, augmentation strategies, and temporal-aware modeling.
Lite3R: A Model-Agnostic Framework for Efficient Feed-Forward 3D Reconstruction cs.CV · 2026-05-12 · unverdicted · none · ref 29
Lite3R cuts latency by 1.7-2.0x and memory by 1.9-2.4x in feed-forward 3D reconstruction using sparse linear attention and FP8-aware quantization-aware training while keeping competitive quality on backbones like VGGT and DA3-Large.

Speed3r: Sparse feed-forward 3d reconstruction models

citation-role summary

citation-polarity summary

fields

years

verdicts

roles

polarities

representative citing papers

citing papers explorer