MoonSplat: Monocular Online Gaussian Splatting with Sim(3) Global Optimization
Pith reviewed 2026-06-27 01:33 UTC · model grok-4.3
The pith
Integrating Sim(3) global optimization with voxelized 3D Gaussian Splatting enables robust monocular online 3D reconstruction.
A machine-rendered reading of the paper's core claim, the machinery that carries it, and where it could break.
Core claim
By integrating global Sim(3) optimization into a voxelized 3DGS pipeline and adding color residual learning, the method delivers state-of-the-art accuracy in camera poses and rendering quality on diverse datasets while running in real time, and supports practical deployment on UAV systems.
What carries the argument
Global Sim(3) optimization applied jointly to camera poses and voxelized 3D Gaussian Splatting, which performs scale-aware loop closure and pose correction.
If this is right
- Camera pose estimation becomes more accurate and drift-free over long sequences.
- Loop closure can be performed efficiently on both poses and the scene model.
- Optimization of the 3DGS representation converges faster with better final quality.
- Real-time performance is maintained even for indoor and outdoor scenes.
- Practical systems like UAV active reconstruction become feasible.
Where Pith is reading between the lines
- The method could be adapted to correct scale ambiguities in other monocular SLAM approaches.
- Extending the color residual strategy might benefit other radiance field methods.
- Integration with active sensing could lead to more efficient exploration strategies in robotics.
- Performance on very long sequences would test the stability of the global optimization.
Load-bearing premise
Global Sim(3) optimization reliably fixes errors from monocular pose estimation without introducing instabilities or additional drift.
What would settle it
Running the method on a long monocular sequence with known ground-truth poses and observing that pose error after loop closure exceeds that of standard methods or that rendering quality falls short.
Figures
read the original abstract
Online 3D reconstruction from monocular image sequences is a challenging and ongoing research topic. 3D Gaussian Splatting (3DGS), leveraging its high-quality real-time rendering capability, empowers online 3D reconstruction to represent dense scenes with enhanced expressiveness, and thus holds great promise for a wide range of applications such as robotics and AR/VR. However, existing online 3DGS methods still suffer from some key challenges: fragile camera pose estimation due to the lack of global optimization, and low optimization efficiency in large-scale or long-sequence scenarios. To address these issues, we propose a robust and efficient online voxelized 3DGS reconstruction framework integrated with global $\text{Sim}(3)$ optimization, which enables reliable camera tracking and efficient global loop closure for both camera poses and voxelized 3DGS. To accelerate the convergence of the voxelized 3DGS, we further introduce a color residual learning strategy, which not only boosts optimization speed but also enhances rendering quality. Extensive experiments on diverse indoor and outdoor datasets demonstrate that our method achieves state-of-the-art performance in both camera pose estimation accuracy and rendering quality, while retaining real-time efficiency. Additionally, we develop and deploy a real-world UAV-based active reconstruction system grounded on our proposed method, validating its robustness and generalizability for practical online 3D reconstruction tasks. Our code and data are available at https://github.com/TrickyGo/MoonSplat.
Editorial analysis
A structured set of objections, weighed in public.
Referee Report
Summary. The paper presents MoonSplat, an online monocular 3D reconstruction framework that combines voxelized 3D Gaussian Splatting with global Sim(3) optimization over both camera poses and the scene representation. It introduces a color residual learning strategy to accelerate convergence and improve rendering. The central claims are that this addresses fragile pose estimation and inefficiency in prior online 3DGS methods, achieves state-of-the-art accuracy in camera tracking and novel-view synthesis on indoor/outdoor datasets while running in real time, and has been deployed on a UAV platform.
Significance. If the empirical results hold, the work would be a meaningful incremental advance for online dense reconstruction in robotics and AR/VR by showing that a global Sim(3) layer can be integrated into voxelized 3DGS without sacrificing real-time performance. The open-sourced code and data are a clear strength for reproducibility.
major comments (2)
- [§4.3] §4.3 (global Sim(3) optimization): the formulation is presented as correcting monocular drift via loop closure, yet the manuscript provides no explicit analysis or ablation showing that the Sim(3) updates do not re-introduce scale or rotational drift on sequences longer than those in the reported tables; this directly bears on the central claim of reliable long-sequence tracking.
- [Table 4] Table 4 (quantitative comparisons): the reported ATE and PSNR gains are load-bearing for the SOTA claim, but the table does not include per-sequence standard deviations or statistical significance tests across the 10+ runs mentioned in the text, making it impossible to judge whether the improvements over the closest baseline are robust.
minor comments (2)
- [§3.4] The color residual learning module is described only at a high level; a short pseudocode block or explicit loss equation would clarify how the residual is added to the standard 3DGS color optimization.
- [Figure 5] Figure 5 (qualitative results) uses inconsistent camera trajectories across rows, making visual comparison of loop-closure behavior difficult.
Simulated Author's Rebuttal
We thank the referee for the constructive comments. We address each major comment below and indicate the revisions we will make.
read point-by-point responses
-
Referee: [§4.3] §4.3 (global Sim(3) optimization): the formulation is presented as correcting monocular drift via loop closure, yet the manuscript provides no explicit analysis or ablation showing that the Sim(3) updates do not re-introduce scale or rotational drift on sequences longer than those in the reported tables; this directly bears on the central claim of reliable long-sequence tracking.
Authors: We appreciate the referee's point. The global Sim(3) optimization is formulated to jointly refine all camera poses and the voxel map under a single similarity transform per loop closure, which is intended to eliminate accumulated monocular drift without re-introducing scale or rotation inconsistencies. Nevertheless, we acknowledge that an explicit ablation on sequences longer than those already reported would provide stronger evidence for the claim. In the revision we will add such an analysis, including results on extended trajectories to verify that scale and rotational drift remain controlled after Sim(3) updates. revision: yes
-
Referee: [Table 4] Table 4 (quantitative comparisons): the reported ATE and PSNR gains are load-bearing for the SOTA claim, but the table does not include per-sequence standard deviations or statistical significance tests across the 10+ runs mentioned in the text, making it impossible to judge whether the improvements over the closest baseline are robust.
Authors: We thank the referee for this observation. The text states that results are averaged over 10+ runs, yet Table 4 reports only mean values. We will update the table to include per-sequence standard deviations. We will also add a brief statistical comparison (e.g., paired t-test p-values or confidence intervals) to quantify the robustness of the reported gains over the closest baseline. revision: yes
Circularity Check
No significant circularity detected
full rationale
The abstract and available text describe a proposed framework (voxelized 3DGS + global Sim(3) optimization + color residual learning) whose central claims rest on empirical SOTA results across datasets and a real-world UAV deployment. No equations, derivation steps, fitted parameters renamed as predictions, or self-citation chains appear in the provided material. The method is presented as an engineering integration whose correctness is asserted via external validation rather than by construction from its own inputs. No load-bearing step reduces to self-definition or renaming of known results.
Axiom & Free-Parameter Ledger
Reference graph
Works this paper leans on
-
[1]
Proceedings of the IEEE/CVF conference on computer vision and pattern recognition , pages=
Scalability in perception for autonomous driving: Waymo open dataset , author=. Proceedings of the IEEE/CVF conference on computer vision and pattern recognition , pages=
-
[2]
Proceedings of the IEEE/CVF International Conference on Computer Vision , pages=
Segs-slam: Structure-enhanced 3d gaussian splatting slam with appearance embedding , author=. Proceedings of the IEEE/CVF International Conference on Computer Vision , pages=
-
[3]
IEEE Transactions on Robotics , volume=
Hi-slam2: Geometry-aware gaussian slam for fast monocular scene reconstruction , author=. IEEE Transactions on Robotics , volume=. 2025 , publisher=
2025
-
[4]
Quarterly of applied mathematics , volume=
A method for the solution of certain non-linear problems in least squares , author=. Quarterly of applied mathematics , volume=
-
[5]
Proceedings of the IEEE international conference on computer vision , pages=
To aggregate or not to aggregate: Selective match kernels for image search , author=. Proceedings of the IEEE international conference on computer vision , pages=
-
[6]
2025 International Conference on 3D Vision (3DV) , pages=
Mast3r-sfm: a fully-integrated solution for unconstrained structure-from-motion , author=. 2025 International Conference on 3D Vision (3DV) , pages=. 2025 , organization=
2025
-
[7]
IEEE transactions on Systems Science and Cybernetics , volume=
A formal basis for the heuristic determination of minimum cost paths , author=. IEEE transactions on Systems Science and Cybernetics , volume=. 1968 , publisher=
1968
-
[8]
2025 International Conference on 3D Vision (3DV) , pages=
Loopsplat: Loop closure by registering 3d gaussian splats , author=. 2025 International Conference on 3D Vision (3DV) , pages=. 2025 , organization=
2025
-
[9]
Wang, Yifan and Zhou, Jianjun and Zhu, Haoyi and Chang, Wenzheng and Zhou, Yang and Li, Zizun and Chen, Junyi and Pang, Jiangmiao and Shen, Chunhua and He, Tong , journal=. pi\^
-
[10]
arXiv preprint arXiv:2511.10647 , year=
Depth anything 3: Recovering the visual space from any views , author=. arXiv preprint arXiv:2511.10647 , year=
-
[11]
ACM Transactions on Graphics (TOG) , volume=
Anysplat: Feed-forward 3d gaussian splatting from unconstrained views , author=. ACM Transactions on Graphics (TOG) , volume=. 2025 , publisher=
2025
-
[12]
European Conference on Computer Vision , pages=
Geocalib: Learning single-image calibration with geometric optimization , author=. European Conference on Computer Vision , pages=. 2024 , organization=
2024
-
[13]
arXiv preprint arXiv:2507.16443 , year=
VGGT-Long: Chunk it, Loop it, Align it--Pushing VGGT's Limits on Kilometer-scale Long RGB Sequences , author=. arXiv preprint arXiv:2507.16443 , year=
-
[14]
Communications of the ACM , volume=
Nerf: Representing scenes as neural radiance fields for view synthesis , author=. Communications of the ACM , volume=. 2021 , publisher=
2021
-
[15]
IEEE transactions on pattern analysis and machine intelligence , volume=
MonoSLAM: Real-time single camera SLAM , author=. IEEE transactions on pattern analysis and machine intelligence , volume=. 2007 , publisher=
2007
-
[16]
IEEE transactions on pattern analysis and machine intelligence , volume=
An eigendecomposition approach to weighted graph matching problems , author=. IEEE transactions on pattern analysis and machine intelligence , volume=. 2002 , publisher=
2002
-
[17]
Proceedings of the IEEE conference on computer vision and pattern recognition , pages=
Structure-from-motion revisited , author=. Proceedings of the IEEE conference on computer vision and pattern recognition , pages=
-
[18]
The International Journal of Robotics Research , volume=
Active vision in robotic systems: A survey of recent developments , author=. The International Journal of Robotics Research , volume=. 2011 , publisher=
2011
-
[19]
IEEE Transactions on Robotics , volume=
Fast-lio2: Fast direct lidar-inertial odometry , author=. IEEE Transactions on Robotics , volume=. 2022 , publisher=
2022
-
[20]
European Conference on Computer Vision , pages=
Learning and aggregating deep local descriptors for instance-level recognition , author=. European Conference on Computer Vision , pages=. 2020 , organization=
2020
-
[21]
Proceedings of the SIGGRAPH Asia 2025 Conference Papers , pages=
Gigaslam: Large-scale monocular slam with hierarchical gaussian splats , author=. Proceedings of the SIGGRAPH Asia 2025 Conference Papers , pages=
2025
-
[22]
Proceedings of the IEEE/CVF International Conference on Computer Vision , pages=
Outdoor monocular slam with global scale-consistent 3d gaussian pointmaps , author=. Proceedings of the IEEE/CVF International Conference on Computer Vision , pages=
-
[23]
ACM Transactions on Graphics (ToG) , volume=
Tanks and temples: Benchmarking large-scale scene reconstruction , author=. ACM Transactions on Graphics (ToG) , volume=. 2017 , publisher=
2017
-
[24]
Proceedings of the Computer Vision and Pattern Recognition Conference , pages=
Vggt: Visual geometry grounded transformer , author=. Proceedings of the Computer Vision and Pattern Recognition Conference , pages=
-
[25]
Proceedings of the Computer Vision and Pattern Recognition Conference , pages=
MASt3R-SLAM: Real-time dense SLAM with 3D reconstruction priors , author=. Proceedings of the Computer Vision and Pattern Recognition Conference , pages=
-
[26]
arXiv preprint arXiv:2505.12549 , year=
Vggt-slam: Dense rgb slam optimized on the sl (4) manifold , author=. arXiv preprint arXiv:2505.12549 , year=
-
[27]
arXiv preprint arXiv:2510.08551 , year=
Artdeco: Towards efficient and high-fidelity on-the-fly 3d reconstruction with structured scene representation , author=. arXiv preprint arXiv:2510.08551 , year=
-
[28]
Proceedings of the IEEE/CVF International Conference on Computer Vision , pages =
Jonathan T Barron and Ben Mildenhall and Matthew Tancik and Peter Hedman and Ricardo Martin-Brualla and Pratul P Srinivasan , title =. Proceedings of the IEEE/CVF International Conference on Computer Vision , pages =
-
[29]
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition , pages =
Jonathan T Barron and Ben Mildenhall and Dor Verbin and Pratul P Srinivasan and Peter Hedman , title =. Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition , pages =
-
[30]
Proceedings of the IEEE/CVF International Conference on Computer Vision , pages =
Jonathan T Barron and Ben Mildenhall and Dor Verbin and Pratul P Srinivasan and Peter Hedman , title =. Proceedings of the IEEE/CVF International Conference on Computer Vision , pages =
-
[31]
CVPR , year =
David Charatan and Sizhe Li and Andrea Tagliasacchi and Vincent Sitzmann , title =. CVPR , year =
-
[32]
The Thirteenth International Conference on Learning Representations , year =
Yihang Chen and Qianyi Wu and Mengyao Li and Weiyao Lin and Mehrtash Harandi and Jianfei Cai , title =. The Thirteenth International Conference on Learning Representations , year =
-
[33]
2024 , eprint =
Yuedong Chen and Haofei Xu and Chuanxia Zheng and Bohan Zhuang and Marc Pollefeys and Andreas Geiger and TatJen Cham and Jianfei Cai , title =. 2024 , eprint =
2024
-
[34]
Ecology , volume =
Philip J Clark and Francis C Evans , title =. Ecology , volume =
-
[35]
Chang and Manolis Savva and Maciej Halber and Thomas Funkhouser and Matthias Nie
Angela Dai and Angel X. Chang and Manolis Savva and Maciej Halber and Thomas Funkhouser and Matthias Nie. Scannet: Richly-annotated 3d reconstructions of indoor scenes , booktitle =. 2017 , publisher =
2017
-
[36]
2024 , eprint =
Tianchen Deng and Yaohui Chen and Leyan Zhang and Jianfei Yang and Shenghai Yuan and Danwei Wang and Weidong Chen , title =. 2024 , eprint =
2024
-
[37]
IEEE transactions on pattern analysis and machine intelligence , volume =
Jakob Engel and Vladlen Koltun and Daniel Cremers , title =. IEEE transactions on pattern analysis and machine intelligence , volume =
-
[38]
Advances in neural information processing systems , volume =
Zhiwen Fan and Kevin Wang and Kairun Wen and Zehao Zhu and Dejia Xu and Zhangyang Wang and others , title =. Advances in neural information processing systems , volume =
-
[39]
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition , pages =
Yang Fu and Sifei Liu and Amey Kulkarni and Jan Kautz and Alexei A Efros and Xiaolong Wang , title =. Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition , pages =
-
[40]
The international journal of robotics research , volume =
Andreas Geiger and Philip Lenz and Christoph Stiller and Raquel Urtasun , title =. The international journal of robotics research , volume =
-
[41]
Milo: Mesh-in-the-loop gaussian splatting for detailed and efficient surface reconstruction , journal =
Antoine Gu. Milo: Mesh-in-the-loop gaussian splatting for detailed and efficient surface reconstruction , journal =. 2025 , url =
2025
-
[42]
European Conference on Computer Vision , year =
Seongbo Ha and Jiung Yeon and Hyeonwoo Yu , title =. European Conference on Computer Vision , year =
-
[43]
Robert M Haralock and Linda G Shapiro , title =
-
[44]
European Conference on Computer Vision , year =
Jiarui Hu and Xianhao Chen and Boyin Feng and Guanglin Li and Liangjing Yang and Hujun Bao and Guofeng Zhang and Zhaopeng Cui , title =. European Conference on Computer Vision , year =
-
[45]
SIGGRAPH 2024 Conference Papers , year =
Binbin Huang and Zehao Yu and Anpei Chen and Andreas Geiger and Shenghua Gao , title =. SIGGRAPH 2024 Conference Papers , year =
2024
-
[46]
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition , year =
Huajian Huang and Longwei Li and Hui Cheng and Sai-Kit Yeung , title =. Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition , year =
-
[47]
Hanwen Jiang and Hao Tan and Peng Wang and Haian Jin and Yue Zhao and Sai Bi and Kai Zhang and Fujun Luan and Kalyan Sunkavalli and Qixing Huang and Georgios Pavlakos , title =
-
[48]
2025 , eprint =
Lihan Jiang and Yucheng Mao and Linning Xu and Tao Lu and Kerui Ren and Yichen Jin and Xudong Xu and Mulin Yu and Jiangmiao Pang and Feng Zhao and others , title =. 2025 , eprint =
2025
-
[49]
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR) , pages =
Lihan Jiang and Kerui Ren and Mulin Yu and Linning Xu and Junting Dong and Tao Lu and Feng Zhao and Dahua Lin and Bo Dai , title =. Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR) , pages =
-
[50]
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition , year =
Nikhil Keetha and Jay Karhade and Krishna Murthy Jatavallabhula and Gengshan Yang and Sebastian Scherer and Deva Ramanan and Jonathon Luiten , title =. Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition , year =
-
[51]
3d gaussian splatting for real-time radiance field rendering , journal =
Bernhard Kerbl and Georgios Kopanas and Thomas Leimk. 3d gaussian splatting for real-time radiance field rendering , journal =
-
[52]
ACM Transactions on Graphics , volume =
Bernhard Kerbl and Andreas Meuleman and Georgios Kopanas and Michael Wimmer and Alexandre Lanvin and George Drettakis , title =. ACM Transactions on Graphics , volume =. 2024 , url =
2024
-
[53]
Vincent Leroy and Yohann Cabon and Jerome Revaud , title =
-
[54]
IEEE Robotics and Automation Letters , year =
Guanghao Li and Yu Cao and Qi Chen and Xin Gao and Yifan Yang and Jian Pu , title =. IEEE Robotics and Automation Letters , year =
-
[55]
IEEE Transactions on Artificial Intelligence , year =
Guanghao Li and Qi Chen and Sijia Hu and Yuxiang Yan and Jian Pu , title =. IEEE Transactions on Artificial Intelligence , year =
-
[56]
Pattern Recognition , volume =
Guanghao Li and Qi Chen and Yuxiang Yan and Jian Pu , title =. Pattern Recognition , volume =
-
[57]
Proceedings of the IEEE/CVF International Conference on Computer Vision , pages =
Yixuan Li and Lihan Jiang and Linning Xu and Yuanbo Xiangli and Zhenzhi Wang and Dahua Lin and Bo Dai , title =. Proceedings of the IEEE/CVF International Conference on Computer Vision , pages =
-
[58]
Proceedings of the IEEE/CVF International Conference on Computer Vision , pages =
Chen-Hsuan Lin and Wei-Chiu Ma and Antonio Torralba and Simon Lucey , title =. Proceedings of the IEEE/CVF International Conference on Computer Vision , pages =
-
[59]
2025 , eprint =
Chin-Yang Lin and Cheng Sun and Fu-En Yang and Min-Hung Chen and Yen-Yu Lin and Yu-Lun Liu , title =. 2025 , eprint =
2025
-
[60]
2025 , month =
Chin-Yang Lin and Cheng Sun and Fu-En Yang and Min-Hung Chen and Yen-Yu Lin and Yu-Lun Liu , title =. 2025 , month =
2025
-
[61]
European Conference on Computer Vision , pages =
Lahav Lipson and Zachary Teed and Jia Deng , title =. European Conference on Computer Vision , pages =
-
[62]
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition , year =
Hidenobu Matsuki and Riku Murai and Paul HJ Kelly and Andrew J Davison , title =. Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition , year =
-
[63]
ACM Transactions on Graphics (TOG) , volume =
Andreas Meuleman and Ishaan Shah and Alexandre Lanvin and Bernhard Kerbl and George Drettakis , title =. ACM Transactions on Graphics (TOG) , volume =
-
[64]
Communications of the ACM , volume =
Ben Mildenhall and Pratul P Srinivasan and Matthew Tancik and Jonathan T Barron and Ravi Ramamoorthi and Ren Ng , title =. Communications of the ACM , volume =
-
[65]
Instant neural graphics primitives with a multiresolution hash encoding , journal =
Thomas M. Instant neural graphics primitives with a multiresolution hash encoding , journal =
-
[66]
ORB-SLAM2: an open-source SLAM system for monocular, stereo and RGB-D cameras , journal =
Ra. ORB-SLAM2: an open-source SLAM system for monocular, stereo and RGB-D cameras , journal =. 2017 , doi =
2017
-
[67]
Davison , title =
Riku Murai and Eric Dexheimer and Andrew J. Davison , title =. 2024 , eprint =
2024
-
[68]
Global structure-from-motion revisited , booktitle =
Linfei Pan and D. Global structure-from-motion revisited , booktitle =
-
[69]
2024 , eprint =
Chensheng Peng and Chenfeng Xu and Yue Wang and Mingyu Ding and Heng Yang and Masayoshi Tomizuka and Kurt Keutzer and Marco Pavone and Wei Zhan , title =. 2024 , eprint =
2024
-
[70]
ACM SIGGRAPH 2024 Conference Papers , pages =
Zhexi Peng and Tianjia Shao and Yong Liu and Jingke Zhou and Yin Yang and Jingdong Wang and Kun Zhou , title =. ACM SIGGRAPH 2024 Conference Papers , pages =
2024
-
[71]
2024 , eprint =
Kerui Ren and Lihan Jiang and Tao Lu and Mulin Yu and Linning Xu and Zhangkai Ni and Bo Dai , title =. 2024 , eprint =
2024
-
[72]
Splat-slam: Globally optimized rgb-only slam with 3d gaussians , booktitle =
Erik Sandstr. Splat-slam: Globally optimized rgb-only slam with 3d gaussians , booktitle =
-
[73]
Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition , year =
Johannes L Schonberger and Jan-Michael Frahm , title =. Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition , year =
-
[74]
Pixelwise View Selection for Unstructured Multi-View Stereo , booktitle =
Johannes Lutz Sch. Pixelwise View Selection for Unstructured Multi-View Stereo , booktitle =
-
[75]
A benchmark for the evaluation of rgb-d slam systems , booktitle =
J. A benchmark for the evaluation of rgb-d slam systems , booktitle =
-
[76]
Proceedings of the IEEE/CVF international conference on computer vision , pages =
Edgar Sucar and Shikun Liu and Joseph Ortiz and Andrew J Davison , title =. Proceedings of the IEEE/CVF international conference on computer vision , pages =
-
[77]
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR) , pages =
Cheng Sun and Min Sun and Hwann-Tzong Chen , title =. Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR) , pages =
-
[78]
Proceedings of the IEEE/CVF conference on computer vision and pattern recognition , pages =
Pei Sun and Henrik Kretzschmar and Xerxes Dotiwalla and Aurelien Chouard and Vijaysai Patnaik and Paul Tsui and James Guo and Yin Zhou and Yuning Chai and Benjamin Caine and others , title =. Proceedings of the IEEE/CVF conference on computer vision and pattern recognition , pages =
-
[79]
2024 , eprint =
Zhenggang Tang and Yuchen Fan and Dilin Wang and Hongyu Xu and Rakesh Ranjan and Alexander Schwing and Zhicheng Yan , title =. 2024 , eprint =
2024
-
[80]
Advances in neural information processing systems , volume =
Zachary Teed and Jia Deng , title =. Advances in neural information processing systems , volume =
discussion (0)
Sign in with ORCID, Apple, or X to comment. Anyone can read and Pith papers without signing in.