pith. sign in

arxiv: 2607.02301 · v1 · pith:EJFY6OBKnew · submitted 2026-07-02 · 💻 cs.CV

InvSplat: Inverse Feed-Forward Scene Splatting

Pith reviewed 2026-07-03 15:29 UTC · model grok-4.3

classification 💻 cs.CV
keywords inverse rendering3D Gaussian splattingfeed-forward reconstructionmaterial estimationnovel view synthesisphysically based renderingmulti-view consistencyrelighting
0
0 comments X

The pith

A feed-forward model predicts 3D Gaussians carrying albedo, metallic, and roughness values directly from multi-view images.

A machine-rendered reading of the paper's core claim, the machinery that carries it, and where it could break.

The paper introduces a reconstruction framework that outputs an explicit 3D Gaussian scene model augmented with intrinsic material parameters in one forward pass. It fuses priors from a material estimation network into a multi-view reconstruction backbone so that geometry and reflectance are recovered jointly rather than through separate optimization. This produces a disentangled representation that supports relighting and novel-view synthesis while improving consistency across views compared with image-space baselines. The approach targets the gap between slow per-scene inverse-rendering methods and fast but inconsistent 2D learning approaches.

Core claim

InvSplat directly predicts a structured 3D Gaussian representation in which each primitive is defined by mean, normal, opacity, rotation, scale, albedo, metallic, and roughness. By integrating material-estimation priors with the multi-view backbone, the model performs joint prediction of geometry and reflectance parameters in a single forward pass, yielding multi-view consistent results, accurate material recovery, and stable novel-view rendering on both synthetic and real datasets.

What carries the argument

The 3D Gaussian primitive extended with intrinsic material attributes (albedo, metallic, roughness) that encodes both geometry and physically based reflectance.

If this is right

  • Physically based relighting becomes feasible from the recovered material parameters.
  • Novel-view images remain stable because the representation is explicitly 3D rather than image-space.
  • Multi-view consistency improves over pure 2D learning baselines.
  • View-dependent effects are modeled more faithfully than with RGB-only feed-forward methods.
  • Material recovery accuracy holds on both synthetic and real-world test sets.

Where Pith is reading between the lines

These are editorial extensions of the paper, not claims the author makes directly.

  • The feed-forward design could extend inverse rendering to video or large-scale scenes where per-scene optimization is prohibitive.
  • If the fusion strategy generalizes, similar material-augmented primitives might be added to other explicit 3D representations.
  • Real-time relighting pipelines in graphics applications could adopt the same single-pass prediction once the backbone is trained.

Load-bearing premise

The material estimation network priors remain accurate and compatible when fused inside the multi-view reconstruction backbone.

What would settle it

Rendered relighting results on a held-out scene that deviate measurably from ground-truth illumination changes while the geometry appears correct.

Figures

Figures reproduced from arXiv: 2607.02301 by Andreas Geiger, Haofei Xu, Hendrik Lensch, Polina Karpikova, Wenjing Bian.

Figure 1
Figure 1. Figure 1: InvSplat Overview. Given a set of posed images, InvSplat reconstructs both the 3D scene geometry and material parameters in real time, enabling novel view synthesis and relighting. Abstract Inverse rendering aims to recover both 3D geometry and physically meaningful material properties from images, enabling applications such as relighting and novel view synthesis. Optimization-based methods achieve high fi… view at source ↗
Figure 2
Figure 2. Figure 2: Method Overview. Our feed-forward multi-view model predicts a physically based 3D Gaussian scene representation (geometry + material parameters) and enforces cross-view consistency through differentiable rendering. is associated with diffuse albedo aj ∈ [0, 1]3 , metallicity mj ∈ [0, 1], and roughness rj ∈ [0, 1]. We further augment each Gaussian with a surface normal nj ∈ R 3 , which enables high-quality … view at source ↗
Figure 3
Figure 3. Figure 3: Qualitative reconstruction results on InteriorVerse. [PITH_FULL_IMAGE:figures/full_fig_p006_3.png] view at source ↗
Figure 4
Figure 4. Figure 4: Generalization to real-world scenes from RealEstate10K. For each of the three scenes, we [PITH_FULL_IMAGE:figures/full_fig_p007_4.png] view at source ↗
Figure 5
Figure 5. Figure 5: Multi-view material consistency on a scene from Structured3D. For each method, the figure [PITH_FULL_IMAGE:figures/full_fig_p008_5.png] view at source ↗
Figure 6
Figure 6. Figure 6: Generalization to a real-world DL3DV scene with four input views. The first two columns [PITH_FULL_IMAGE:figures/full_fig_p008_6.png] view at source ↗
Figure 7
Figure 7. Figure 7: Material/lighting editing on an Infinigen scene. First row Infinigen scene, in second row [PITH_FULL_IMAGE:figures/full_fig_p009_7.png] view at source ↗
Figure 8
Figure 8. Figure 8: Alternative derivation for gaus￾sian normals. First row: input image, normals derived from depth. Second row: rendered normals, left is separate head for prediction, right is prediction in gaussian head. We also ablate the normal prediction branch and test two other variants: one in which we directly compute normals from depth using finite differences, and another in which we predict normals from the Gauss… view at source ↗
Figure 9
Figure 9. Figure 9: Failure case example. Our model inherits the limitations of 2 domains. From the feed-forward scene reconstruction side, if poses are estimated incorrectly, our reconstruction will produce cor￾rupted results, as illustrated in [PITH_FULL_IMAGE:figures/full_fig_p013_9.png] view at source ↗
Figure 10
Figure 10. Figure 10: RGB reconstruction comparison on a synthetic Infinigen scene across two views (rows) [PITH_FULL_IMAGE:figures/full_fig_p014_10.png] view at source ↗
Figure 11
Figure 11. Figure 11: RGB reconstruction comparison on DL3DV across four views (rows) and methods [PITH_FULL_IMAGE:figures/full_fig_p014_11.png] view at source ↗
Figure 12
Figure 12. Figure 12: Qualitative results on three DL3DV scenes with 2 input views each. For every scene the [PITH_FULL_IMAGE:figures/full_fig_p015_12.png] view at source ↗
Figure 13
Figure 13. Figure 13: Qualitative results on three DL3DV scenes with 4 input views each. For every scene the [PITH_FULL_IMAGE:figures/full_fig_p016_13.png] view at source ↗
Figure 14
Figure 14. Figure 14: Qualitative albedo comparison on InteriorVerse. Each row shows one view of a scene. [PITH_FULL_IMAGE:figures/full_fig_p017_14.png] view at source ↗
Figure 15
Figure 15. Figure 15: Qualitative metallic and roughness comparison on InteriorVerse for the same 3 scenes [PITH_FULL_IMAGE:figures/full_fig_p018_15.png] view at source ↗
Figure 16
Figure 16. Figure 16: Multi-view consistency on additional examples from Structured3D for albedo, metallic [PITH_FULL_IMAGE:figures/full_fig_p019_16.png] view at source ↗
read the original abstract

Inverse rendering aims to recover both 3D geometry and physically meaningful material properties from images, enabling applications such as relighting and novel view synthesis. Optimization-based methods achieve high fidelity but require costly per-scene fitting, while image-space learning-based approaches often suffer from multi-view inconsistencies and lack an explicit 3D representation for stable novel view rendering. We present a feed-forward multi-view reconstruction framework for inverse rendering that directly predicts a structured 3D Gaussian representation with intrinsic material attributes. Each Gaussian primitive is parameterized by mean, normal, opacity, rotation, scale, albedo, metallic, and roughness, enabling a disentangled and physically grounded scene representation. Our model integrates priors from a material estimation network with a multi-view 3D reconstruction backbone, allowing joint prediction of geometry and reflectance parameters in a single forward pass. Experiments on synthetic and real-world datasets demonstrate improved multi-view consistency compared to 2D baselines, accurate material recovery, and stable novel view rendering. Our representation further supports physically-based relighting and more faithful modeling of view-dependent effects compared to existing RGB-based feed-forward reconstruction methods. Our project webpage is: $\href{https://poliik.github.io/invsplat/}{\text{https://poliik.github.io/invsplat/}}$.

Editorial analysis

A structured set of objections, weighed in public.

Desk editor's note, referee report, simulated authors' rebuttal, and a circularity audit. Tearing a paper down is the easy half of reading it; the pith above is the substance, this is the friction.

Referee Report

2 major / 1 minor

Summary. The paper presents InvSplat, a feed-forward multi-view reconstruction framework for inverse rendering. It directly predicts a structured 3D Gaussian representation in which each primitive is parameterized by mean, normal, opacity, rotation, scale, albedo, metallic, and roughness. The central claim is that integrating priors from a material estimation network with a multi-view 3D reconstruction backbone enables joint prediction of geometry and reflectance parameters in a single forward pass, yielding improved multi-view consistency, accurate material recovery, stable novel-view rendering, and support for physically-based relighting on synthetic and real datasets.

Significance. A working feed-forward method that produces an explicit, disentangled 3D Gaussian representation with intrinsic material attributes would be a meaningful step beyond both per-scene optimization pipelines and purely image-space learning approaches, particularly if it delivers consistent geometry and reflectance without requiring post-hoc fitting.

major comments (2)
  1. [Abstract] Abstract: the claim that the model 'integrates priors from a material estimation network with a multi-view 3D reconstruction backbone, allowing joint prediction of geometry and reflectance parameters in a single forward pass' is load-bearing for the entire contribution, yet the abstract supplies no description of the fusion architecture, conditioning mechanism, joint loss terms, or regularization that would prevent one branch from corrupting the other. Without these details the compatibility assumption remains unverified.
  2. [Abstract] Abstract: no quantitative tables, error metrics, ablation studies, dataset descriptions, or baseline comparisons are referenced, so the stated improvements in multi-view consistency and material recovery cannot be assessed from the provided text.
minor comments (1)
  1. [Abstract] The project webpage URL is given but the manuscript does not indicate whether code or trained models will be released.

Simulated Author's Rebuttal

2 responses · 0 unresolved

We thank the referee for the constructive feedback on the abstract. We address each major comment below, clarifying the role of the abstract versus the full manuscript and proposing targeted revisions where they strengthen the presentation without altering the core contribution.

read point-by-point responses
  1. Referee: [Abstract] Abstract: the claim that the model 'integrates priors from a material estimation network with a multi-view 3D reconstruction backbone, allowing joint prediction of geometry and reflectance parameters in a single forward pass' is load-bearing for the entire contribution, yet the abstract supplies no description of the fusion architecture, conditioning mechanism, joint loss terms, or regularization that would prevent one branch from corrupting the other. Without these details the compatibility assumption remains unverified.

    Authors: We agree that the abstract is high-level and does not enumerate the technical mechanisms. The fusion architecture (cross-attention between material and geometry branches), conditioning (material features injected into the Gaussian decoder), joint loss formulation (combined reconstruction, material, and consistency terms), and regularization (disentanglement penalties) are fully specified in Section 3. To address the concern, we will revise the abstract to include one additional sentence outlining the high-level integration strategy and the use of joint training objectives that enforce compatibility. revision: yes

  2. Referee: [Abstract] Abstract: no quantitative tables, error metrics, ablation studies, dataset descriptions, or baseline comparisons are referenced, so the stated improvements in multi-view consistency and material recovery cannot be assessed from the provided text.

    Authors: Abstracts are space-constrained and conventionally omit tables, specific metrics, and detailed experimental descriptions; these elements appear in Section 4 (Experiments), including quantitative tables, ablation studies, dataset specifications, and baseline comparisons that substantiate the claims of improved multi-view consistency and material recovery. We therefore do not believe the abstract requires expansion to include such details, as doing so would violate length guidelines and duplicate content already present in the body of the paper. revision: no

Circularity Check

0 steps flagged

No circularity detected in derivation chain

full rationale

The provided abstract and description contain no equations, fitted parameters, self-citations, or derivation steps that reduce to inputs by construction. The model is described as integrating priors from a material estimation network with a multi-view backbone for joint prediction, but this is presented as an architectural choice without any self-definitional, fitted-input, or uniqueness-imported circularity. No load-bearing claims rely on prior self-work in a way that collapses the result. The derivation is self-contained against external benchmarks as an empirical feed-forward method.

Axiom & Free-Parameter Ledger

0 free parameters · 0 axioms · 0 invented entities

Only the abstract is available; therefore no concrete free parameters, axioms, or invented entities can be extracted beyond the high-level parameterization listed in the abstract.

pith-pipeline@v0.9.1-grok · 5761 in / 1221 out tokens · 19023 ms · 2026-07-03T15:29:30.489016+00:00 · methodology

discussion (0)

Sign in with ORCID, Apple, or X to comment. Anyone can read and Pith papers without signing in.

Reference graph

Works this paper leans on

34 extracted references · 8 canonical work pages · 3 internal anchors

  1. [1]

    Gs-ir: 3d gaussian splatting for inverse rendering

    Zhihao Liang, Qi Zhang, Ying Feng, Ying Shan, and Kui Jia. Gs-ir: 3d gaussian splatting for inverse rendering. InProceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, pages 21644–21653, 2024

  2. [2]

    Iris: Inverse rendering of indoor scenes from low dynamic range images

    Chih-Hao Lin, Jia-Bin Huang, Zhengqin Li, Zhao Dong, Christian Richardt, Tuotuo Li, Michael Zollhöfer, Johannes Kopf, Shenlong Wang, and Changil Kim. Iris: Inverse rendering of indoor scenes from low dynamic range images. InCVPR, 2025

  3. [3]

    Nerfactor: Neural factorization of shape and reflectance under an unknown illumination.ACM Transactions on Graphics (ToG), 40(6):1–18, 2021

    Xiuming Zhang, Pratul P Srinivasan, Boyang Deng, Paul Debevec, William T Freeman, and Jonathan T Barron. Nerfactor: Neural factorization of shape and reflectance under an unknown illumination.ACM Transactions on Graphics (ToG), 40(6):1–18, 2021

  4. [4]

    Inverse path tracing for joint material and lighting estimation

    Dejan Azinovic, Tzu-Mao Li, Anton Kaplanyan, and Matthias Nießner. Inverse path tracing for joint material and lighting estimation. InProceedings of the IEEE/CVF conference on computer vision and pattern recognition, pages 2447–2456, 2019

  5. [5]

    Nerd: Neural reflectance decomposition from image collections

    Mark Boss, Raphael Braun, Varun Jampani, Jonathan T Barron, Ce Liu, and Hendrik Lensch. Nerd: Neural reflectance decomposition from image collections. InProceedings of the IEEE/CVF International Conference on Computer Vision, pages 12684–12694, 2021

  6. [6]

    Learning intrinsic image decomposition from watching the world

    Zhengqi Li and Noah Snavely. Learning intrinsic image decomposition from watching the world. In Proceedings of the IEEE conference on computer vision and pattern recognition, pages 9039–9048, 2018

  7. [7]

    Inverse rendering for complex indoor scenes: Shape, spatially-varying lighting and svbrdf from a single image

    Zhengqin Li, Mohammad Shafiei, Ravi Ramamoorthi, Kalyan Sunkavalli, and Manmohan Chandraker. Inverse rendering for complex indoor scenes: Shape, spatially-varying lighting and svbrdf from a single image. InProceedings of the IEEE/CVF conference on computer vision and pattern recognition, pages 2475–2484, 2020

  8. [8]

    Learning-based inverse rendering of complex indoor scenes with differentiable monte carlo raytracing

    Jingsen Zhu, Fujun Luan, Yuchi Huo, Zihao Lin, Zhihua Zhong, Dianbing Xi, Rui Wang, Hujun Bao, Jiaxi- ang Zheng, and Rui Tang. Learning-based inverse rendering of complex indoor scenes with differentiable monte carlo raytracing. InSiggraph asia 2022 conference papers, pages 1–8, 2022

  9. [9]

    Mvinverse: Feed-forward multi-view inverse rendering in seconds.arXiv preprint arXiv:2512.21003, 2025

    Xiangzuo Wu, Chengwei Ren, Jun Zhou, Xiu Li, and Yuan Liu. Mvinverse: Feed-forward multi-view inverse rendering in seconds.arXiv preprint arXiv:2512.21003, 2025

  10. [10]

    Diffusionrenderer: Neural inverse and forward rendering with video diffusion models

    Ruofan Liang, Zan Gojcic, Huan Ling, Jacob Munkberg, Jon Hasselgren, Zhi-Hao Lin, Jun Gao, Alexander Keller, Nandita Vijaykumar, Sanja Fidler, and Zian Wang. Diffusionrenderer: Neural inverse and forward rendering with video diffusion models. InProceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR), June 2025

  11. [11]

    Dnf-intrinsic: Deterministic noise-free diffusion for indoor inverse rendering.arXiv preprint arXiv:2507.03924, 2025

    Rongjia Zheng, Qing Zhang, Chengjiang Long, and Wei-Shi Zheng. Dnf-intrinsic: Deterministic noise-free diffusion for indoor inverse rendering.arXiv preprint arXiv:2507.03924, 2025. Accepted to ICCV 2025

  12. [12]

    Worldmirror: Universal 3d world reconstruction with any-prior prompting.arXiv preprint arXiv:2510.10726, 2025

    Yifan Liu, Zhiyuan Min, Zhenwei Wang, Junta Wu, Tengfei Wang, Yixuan Yuan, Yawei Luo, and Chunchao Guo. Worldmirror: Universal 3d world reconstruction with any-prior prompting.arXiv preprint arXiv:2510.10726, 2025

  13. [13]

    Depth Anything 3: Recovering the Visual Space from Any Views

    Haotong Lin, Sili Chen, Jun Hao Liew, Donny Y . Chen, Zhenyu Li, Guang Shi, Jiashi Feng, and Bingyi Kang. Depth anything 3: recovering the visual space from any views.arXiv preprint arXiv:2511.10647, 2025

  14. [14]

    Vggt: Visual geometry grounded transformer

    Jianyuan Wang, Minghao Chen, Nikita Karaev, Andrea Vedaldi, Christian Rupprecht, and David Novotny. Vggt: Visual geometry grounded transformer. InProceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2025

  15. [15]

    Dust3r: Geometric 3d vision made easy

    Shuzhe Wang, Vincent Leroy, Yohann Cabon, Boris Chidlovskii, and Jerome Revaud. Dust3r: Geometric 3d vision made easy. InProceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR), pages 20697–20709, June 2024

  16. [16]

    arXiv preprint arXiv:2510.08575 (2025)

    Haofei Xu, Daniel Barath, Andreas Geiger, and Marc Pollefeys. Resplat: Learning recurrent gaussian splats.arXiv preprint arXiv:2510.08575, 2025

  17. [17]

    Depthsplat: Connecting gaussian splatting and depth

    Haofei Xu, Songyou Peng, Fangjinhua Wang, Hermann Blum, Daniel Barath, Andreas Geiger, and Marc Pollefeys. Depthsplat: Connecting gaussian splatting and depth. InProceedings of the Computer Vision and Pattern Recognition Conference, pages 16453–16463, 2025. 10

  18. [18]

    Intrinsic image fusion for multi-view 3d material reconstruction

    Peter Kocsis, Lukas Höllein, and Matthias Nießner. Intrinsic image fusion for multi-view 3d material reconstruction. InProceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR), 2026

  19. [19]

    3d gaussian splatting for real-time radiance field rendering.ACM Trans

    Bernhard Kerbl, Georgios Kopanas, Thomas Leimkühler, George Drettakis, et al. 3d gaussian splatting for real-time radiance field rendering.ACM Trans. Graph., 42(4):139–1, 2023

  20. [20]

    pixelsplat: 3d gaussian splats from image pairs for scalable generalizable 3d reconstruction

    David Charatan, Sizhe Lester Li, Andrea Tagliasacchi, and Vincent Sitzmann. pixelsplat: 3d gaussian splats from image pairs for scalable generalizable 3d reconstruction. InProceedings of the IEEE/CVF conference on computer vision and pattern recognition, pages 19457–19467, 2024

  21. [21]

    Mvsplat: Efficient 3d gaussian splatting from sparse multi-view images

    Yuedong Chen, Haofei Xu, Chuanxia Zheng, Bohan Zhuang, Marc Pollefeys, Andreas Geiger, Tat-Jen Cham, and Jianfei Cai. Mvsplat: Efficient 3d gaussian splatting from sparse multi-view images. In European conference on computer vision, pages 370–386. Springer, 2024

  22. [22]

    No pose, no problem: Surprisingly simple 3d gaussian splats from sparse unposed images.arXiv preprint arXiv:2410.24207, 2024

    Botao Ye, Sifei Liu, Haofei Xu, Xueting Li, Marc Pollefeys, Ming-Hsuan Yang, and Songyou Peng. No pose, no problem: Surprisingly simple 3d gaussian splats from sparse unposed images.arXiv preprint arXiv:2410.24207, 2024

  23. [23]

    Deep residual learning for image recognition

    Kaiming He, Xiangyu Zhang, Shaoquing Ren, and Jian Sun. Deep residual learning for image recognition. 2016 ieee conf. InComput. Vis. Pattern Recognit, pages 770–778, 2015

  24. [24]

    DINOv2: Learning Robust Visual Features without Supervision

    Maxime Oquab, Timothée Darcet, Théo Moutakanni, Huy V o, Marc Szafraniec, Vasil Khalidov, Pierre Fernandez, Daniel Haziza, Francisco Massa, Alaaeldin El-Nouby, et al. Dinov2: Learning robust visual features without supervision.arXiv preprint arXiv:2304.07193, 2023

  25. [25]

    Towards robust monocular depth estimation: Mixing datasets for zero-shot cross-dataset transfer.IEEE Transactions on Pattern Analysis and Machine Intelligence (TPAMI), 2020

    René Ranftl, Katrin Lasinger, David Hafner, Konrad Schindler, and Vladlen Koltun. Towards robust monocular depth estimation: Mixing datasets for zero-shot cross-dataset transfer.IEEE Transactions on Pattern Analysis and Machine Intelligence (TPAMI), 2020

  26. [26]

    Point transformer

    Hengshuang Zhao, Li Jiang, Jiaya Jia, Philip HS Torr, and Vladlen Koltun. Point transformer. In Proceedings of the IEEE/CVF international conference on computer vision, pages 16259–16268, 2021

  27. [27]

    Moge: Unlocking accurate monocular geometry estimation for open-domain images with optimal training supervision

    Ruicheng Wang, Sicheng Xu, Cassie Dai, Jianfeng Xiang, Yu Deng, Xin Tong, and Jiaolong Yang. Moge: Unlocking accurate monocular geometry estimation for open-domain images with optimal training supervision. InProceedings of the Computer Vision and Pattern Recognition Conference, pages 5261–5271, 2025

  28. [28]

    Structured3d: A large photo-realistic dataset for structured 3d modeling

    Jia Zheng, Junfei Zhang, Jing Li, Rui Tang, Shenghua Gao, and Zihan Zhou. Structured3d: A large photo-realistic dataset for structured 3d modeling. InEuropean Conference on Computer Vision, pages 519–535. Springer, 2020

  29. [29]

    Infinigen indoors: Photorealistic indoor scenes using procedural generation

    Alexander Raistrick, Lingjie Mei, Karhan Kayan, David Yan, Yiming Zuo, Beining Han, Hongyu Wen, Meenal Parakh, Stamatis Alexandropoulos, Lahav Lipson, Zeyu Ma, and Jia Deng. Infinigen indoors: Photorealistic indoor scenes using procedural generation. InProceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR), pages 21783–21...

  30. [30]

    Stereo Magnification: Learning View Synthesis using Multiplane Images

    Tinghui Zhou, Richard Tucker, John Flynn, Graham Fyffe, and Noah Snavely. Stereo magnification: Learning view synthesis using multiplane images.arXiv preprint arXiv:1805.09817, 2018

  31. [31]

    Dl3dv-10k: A large-scale scene dataset for deep learning-based 3d vision

    Lu Ling, Yichen Sheng, Zhi Tu, Wentian Zhao, Cheng Xin, Kun Wan, Lantao Yu, Qianyu Guo, Zixun Yu, Yawen Lu, et al. Dl3dv-10k: A large-scale scene dataset for deep learning-based 3d vision. InProceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, pages 22160–22169, 2024

  32. [32]

    Physically-based shading at disney

    Brent Burley and Walt Disney Animation Studios. Physically-based shading at disney. InAcm siggraph, volume 2012, pages 1–7. vol. 2012, 2012

  33. [33]

    Structure-from-motion revisited

    Johannes L Schonberger and Jan-Michael Frahm. Structure-from-motion revisited. InProceedings of the IEEE conference on computer vision and pattern recognition, pages 4104–4113, 2016

  34. [34]

    Optix: a general purpose ray tracing engine.Acm transactions on graphics (tog), 29(4):1–13, 2010

    Steven G Parker, James Bigler, Andreas Dietrich, Heiko Friedrich, Jared Hoberock, David Luebke, David McAllister, Morgan McGuire, Keith Morley, Austin Robison, et al. Optix: a general purpose ray tracing engine.Acm transactions on graphics (tog), 29(4):1–13, 2010. 11 A Supplementary A.1 Architecture details Although the geometry and intrinsic branches of ...