pith. sign in

arxiv: 2606.31679 · v1 · pith:DYZ2QB6Ynew · submitted 2026-06-30 · 💻 cs.GR · cs.CV

Practical High-Fidelity Novel-View Synthesis of Mounted Lepidoptera

Pith reviewed 2026-07-01 02:32 UTC · model grok-4.3

classification 💻 cs.GR cs.CV
keywords lepidoptera3D reconstructionnovel view synthesisGaussian splattingfocus stackingmacro photographymirror systemdigital preservation
0
0 comments X

The pith

An end-to-end pipeline turns mounted butterflies into photo-realistic 3D models viewable from every direction.

A machine-rendered reading of the paper's core claim, the machinery that carries it, and where it could break.

The paper sets out to show that the two main barriers to high-fidelity 3D capture of pinned lepidoptera—macro lenses with tiny depth of field and the inability to photograph the underside without touching the specimen—can be removed by a practical combination of capture and rendering steps. It demonstrates that handheld focus stacking supplies all-in-focus source images, a first-surface mirror placed beneath the specimen reveals the ventral surface without contact, and a modified 3D Gaussian Splatting method renders novel views that include reflections without any segmentation or explicit mirror geometry. A sympathetic reader would care because the result makes it feasible to digitize fragile natural-history objects at microscopic scale so they can be examined from any angle without risk of damage.

Core claim

The end-to-end pipeline resolves the challenges of limited depth of field and inaccessible ventral surfaces to produce photo-realistic 3D models of mounted lepidoptera viewable from every direction, validated on four diverse specimens.

What carries the argument

The mirror-aware 3D Gaussian Splatting extension that models reflections without segmentation or explicit mirror geometry in the optimization.

If this is right

  • All captured specimens become inspectable from any direction at macro resolution.
  • No tripod or physical contact with the specimen is required during acquisition.
  • The same source images suffice for both dorsal and ventral surfaces.
  • The reconstruction process remains fully automatic after capture.

Where Pith is reading between the lines

These are editorial extensions of the paper, not claims the author makes directly.

  • The same capture and rendering steps could be applied to other small pinned insects or fragile museum objects.
  • Digital models produced this way could be shared online for remote study without risking the physical specimens.
  • The pipeline might be combined with existing focus-stacking hardware to reduce the manual effort of handheld capture.

Load-bearing premise

The mirror-aware 3D Gaussian Splatting extension can accurately model reflections and produce high-fidelity novel views without segmentation or explicit handling of the mirror geometry in the optimization process.

What would settle it

Rendered novel views from ventral angles or mirror-reflected paths fail to match actual photographs of the same specimens taken by an independent capture method, showing visible artifacts in fine wing veins, hairs, or reflection boundaries.

Figures

Figures reproduced from arXiv: 2606.31679 by Kristof Overdulve, Lode Jorissen, Nick Michiels.

Figure 1
Figure 1. Figure 1: Photo-realistic 3D reconstruction of mounted Lepidoptera. We present a pipeline for the photo-realistic 3D reconstruction of mounted Lepidoptera. This is challenging as such specimens are typically small, necessitating the use of macro lenses with a severely limited depth of field (DoF), and a conventional camera cannot reach the ventral side without physically moving the fragile specimen. Our pipeline com… view at source ↗
Figure 2
Figure 2. Figure 2: Pipeline overview. First-surface mirrors are positioned around the mounted specimen without touching it, and a handheld focus stack is acquired at each viewpoint and registered into an all-in-focus image. Structure-from-motion and multi-view stereo then recover the camera poses and a dense point cloud, from which we detect the mirror plane. Finally, our mirror-aware 3DGS reflects the Gaussians across this … view at source ↗
Figure 3
Figure 3. Figure 3: Handheld focus stacking. (a, b) show 2 limited DoF depth slices. Pyramid fusion without prior alignment (c) produces ghost edges and scale smearing from handheld jitter. Our ECC-based sequential registration step corrects these misalignments before fusion, yielding a sharp, all-in-focus composite (d). tangent plane of its target correspondence, which is more accurate than Point-to-Point ICP on curved surfa… view at source ↗
Figure 4
Figure 4. Figure 4: Evaluation dataset. The mounted butterfly specimens used for quantitative evaluation were selected for their visual di￾versity and, in particular, their range of physical sizes. detected plane onto the camera side for initializing the splats before starting training. 4. Experiments No prior end-to-end system targets the exact task explored in our paper, so there is no complete system to benchmark against. … view at source ↗
Figure 5
Figure 5. Figure 5: Qualitative comparison of macro-defocus handling. Each column is a held-out view (six views spanning four specimens); the rows show, top to bottom, the ground truth, handheld focus stacking (ours), and DoF-aware 3DGS [27] [PITH_FULL_IMAGE:figures/full_fig_p007_5.png] view at source ↗
Figure 6
Figure 6. Figure 6: Qualitative comparison of mirror handling. For each specimen (row) we show three novel viewpoints per method—roughly dorsal, lateral, and ventral. Our method holds together as a single consistent object, whereas the mirror-unaware baseline disintegrates away from the training views, making the gap clear even where the metrics are close. We deliberately show each method’s raw, full-scene render—specimen, mi… view at source ↗
read the original abstract

Mounted butterflies are among the most striking objects in natural history collections. However, their beauty is notoriously hard to digitize in 3D: they are small and fragile, with microscopic hairs and vein structures. Capturing them in sufficient detail, therefore, requires a macro lens, which has a very limited Depth of Field (DoF). Moreover, a camera body cannot be maneuvered beneath a pinned specimen to photograph its ventral surface (the underside of the wings). We introduce an end-to-end pipeline that resolves these challenges to turn such specimens into photo-realistic 3D models viewable from every direction. It combines three ingredients: handheld focus stacking for all-in-focus macro capture without a tripod, a non-contact first-surface mirror system that exposes the ventral surface without touching the specimen, and a segmentation-free, mirror-aware 3D Gaussian Splatting extension. We validate the reconstructions on four diverse specimens.

Editorial analysis

A structured set of objections, weighed in public.

Desk editor's note, referee report, simulated authors' rebuttal, and a circularity audit. Tearing a paper down is the easy half of reading it; the pith above is the substance, this is the friction.

Referee Report

3 major / 0 minor

Summary. The paper claims to present an end-to-end pipeline for creating photo-realistic 3D models of mounted lepidoptera specimens that are viewable from every direction. The pipeline combines handheld focus stacking for all-in-focus macro capture, a non-contact first-surface mirror system to expose ventral surfaces, and a segmentation-free mirror-aware extension to 3D Gaussian Splatting. Validation is reported on four diverse specimens.

Significance. If the central claims hold, the work would provide a practical method for high-fidelity digitization of fragile, small-scale natural history specimens that are otherwise difficult to capture due to limited depth of field and inaccessible surfaces. This could support digital archiving and research in entomology while contributing an applied extension of 3DGS to handle mirror reflections without segmentation or explicit geometry modeling.

major comments (3)
  1. [Abstract] Abstract: The validation on four specimens is stated without any quantitative metrics (e.g., PSNR, SSIM, LPIPS), error analysis, or baseline comparisons to standard 3DGS or other novel-view synthesis methods. This absence is load-bearing for the claims of 'high-fidelity' and 'photo-realistic' results.
  2. [Abstract / pipeline description] The mirror-aware 3D Gaussian Splatting extension is presented as segmentation-free and capable of accurately modeling reflections as virtual geometry during optimization without explicit mirror plane, mask, or reflection-specific loss. However, the mechanism preventing misinterpretation of reflections as additional real geometry (which could produce duplicated or blended representations) is not specified, directly affecting the guarantee of accurate ventral novel views.
  3. [Abstract] The strongest claim requires that the pipeline resolves both limited DoF and inaccessible ventral surfaces to enable 'viewable from every direction' models. Without reported quantitative evaluation of novel-view quality on ventral surfaces captured via the mirror, it is not possible to verify that the extension correctly handles the view-dependent duplication introduced by reflections.

Simulated Author's Rebuttal

3 responses · 0 unresolved

We thank the referee for the constructive comments on our manuscript. We address each major comment below, indicating where revisions will be incorporated.

read point-by-point responses
  1. Referee: [Abstract] Abstract: The validation on four specimens is stated without any quantitative metrics (e.g., PSNR, SSIM, LPIPS), error analysis, or baseline comparisons to standard 3DGS or other novel-view synthesis methods. This absence is load-bearing for the claims of 'high-fidelity' and 'photo-realistic' results.

    Authors: The abstract is a concise summary; the full manuscript reports visual and qualitative validation on the four specimens in Section 4. We agree that referencing quantitative support would strengthen the abstract. In revision we will add a clause noting the reported fidelity metrics and baseline comparisons from the results section. revision: yes

  2. Referee: [Abstract / pipeline description] The mirror-aware 3D Gaussian Splatting extension is presented as segmentation-free and capable of accurately modeling reflections as virtual geometry during optimization without explicit mirror plane, mask, or reflection-specific loss. However, the mechanism preventing misinterpretation of reflections as additional real geometry (which could produce duplicated or blended representations) is not specified, directly affecting the guarantee of accurate ventral novel views.

    Authors: The mechanism is that all captured images (direct and mirrored) are optimized jointly within a single 3DGS scene; reflected content is represented by Gaussians placed at virtual locations consistent with the known mirror geometry, and the photometric loss across all views prevents duplicate real geometry because any extraneous Gaussians would increase error on the direct views. We will expand the method section with an explicit paragraph clarifying this consistency constraint. revision: yes

  3. Referee: [Abstract] The strongest claim requires that the pipeline resolves both limited DoF and inaccessible ventral surfaces to enable 'viewable from every direction' models. Without reported quantitative evaluation of novel-view quality on ventral surfaces captured via the mirror, it is not possible to verify that the extension correctly handles the view-dependent duplication introduced by reflections.

    Authors: We agree that separate quantitative metrics on the mirror-derived ventral views would directly address this point. The current evaluation includes qualitative novel-view results from all directions; we will add a table of PSNR/SSIM/LPIPS computed specifically on held-out ventral test views in the revised results section. revision: yes

Circularity Check

0 steps flagged

No circularity: applied engineering pipeline with no derivations or self-referential claims

full rationale

The paper presents an end-to-end practical pipeline combining handheld focus stacking, a non-contact mirror system, and a segmentation-free mirror-aware 3D Gaussian Splatting extension. No equations, derivations, fitted parameters renamed as predictions, or load-bearing self-citations appear in the provided text. The central claim is an empirical validation on four specimens rather than a mathematical reduction; the mirror-aware extension is described as an implementation choice without any self-definitional loop or uniqueness theorem imported from prior author work. This is a standard non-circular engineering contribution.

Axiom & Free-Parameter Ledger

0 free parameters · 0 axioms · 0 invented entities

Abstract-only review provides no equations or implementation details, so no free parameters, axioms, or invented entities can be identified from the given text.

pith-pipeline@v0.9.1-grok · 5688 in / 1137 out tokens · 37930 ms · 2026-07-01T02:32:40.388382+00:00 · methodology

discussion (0)

Sign in with ORCID, Apple, or X to comment. Anyone can read and Pith papers without signing in.

Reference graph

Works this paper leans on

30 extracted references · 2 canonical work pages

  1. [1]

    Dynamic adaptive sampling for accurate image-based 3d insect reconstruction using neural implicit surfaces

    Abderraouf Amrani, Hamid Laga, V olker Framenau, and Melissa Thomas. Dynamic adaptive sampling for accurate image-based 3d insect reconstruction using neural implicit surfaces. In2025 International Conference on Digital Image Computing: Techniques and Applications (DICTA), pages 1–

  2. [2]

    Burt and Edward H

    Peter J. Burt and Edward H. Adelson. A multiresolution spline with application to image mosaics.ACM Transactions on Graphics, 2(4):217–236, 1983. 4

  3. [3]

    Object modelling by regis- tration of multiple range images

    Yang Chen and G ´erard Medioni. Object modelling by regis- tration of multiple range images. InImage and Vision Com- puting, pages 145–155, 1992. 4

  4. [4]

    Evangelidis and Emmanouil Z

    Georgios D. Evangelidis and Emmanouil Z. Psarakis. Para- metric image alignment using enhanced correlation coeffi- cient maximization.IEEE Transactions on Pattern Analysis and Machine Intelligence, 30(10):1858–1865, 2008. 3

  5. [5]

    Fischler and Robert C

    Martin A. Fischler and Robert C. Bolles. Random sample consensus: A paradigm for model fitting with applications to image analysis and automated cartography. InCommunica- tions of the ACM, pages 381–395, 1981. 4

  6. [6]

    Plenoxels: Radiance fields without neural networks

    Sara Fridovich-Keil et al. Plenoxels: Radiance fields without neural networks. InProc. CVPR, pages 5501–5510, 2022. 2

  7. [7]

    Helicon focus.https : / / www

    Helicon Soft Ltd. Helicon focus.https : / / www . heliconsoft . com / heliconsoft - products / helicon - focus/, 2026. Version 8. 4

  8. [8]

    3D Gaussian Splatting for Real-Time Radiance Field Rendering.ACM Transactions on Graphics, 42(4):1–14, 2023

    Bernhard Kerbl, Georgios Kopanas, Thomas Leimk ¨uhler, and George Drettakis. 3D Gaussian Splatting for Real-Time Radiance Field Rendering.ACM Transactions on Graphics, 42(4):1–14, 2023. 2, 4, 5

  9. [9]

    Perspective-consistent mul- tifocus multiview 3d reconstruction of small objects

    Hengjia Li and Chuong Nguyen. Perspective-consistent mul- tifocus multiview 3d reconstruction of small objects. In 2019 Digital Image Computing: Techniques and Applica- tions (DICTA), pages 1–8. IEEE, 2019. 2

  10. [10]

    Mirrorgaussian: Reflecting 3d gaussians for reconstructing mirror reflections

    Jiayue Liu, Xiao Tang, Freeman Cheng, Roy Yang, Zhihao Li, Jianzhuang Liu, Yi Huang, Jiaqi Lin, Shiyong Liu, Xi- aofei Wu, et al. Mirrorgaussian: Reflecting 3d gaussians for reconstructing mirror reflections. InEuropean Conference on Computer Vision, pages 377–393. Springer, 2024. 3

  11. [11]

    Deblur-nerf: Neural radiance fields from blurry images

    Li Ma, Xiaoyu Li, Jing Liao, Qi Zhang, Xuan Wang, Jue Wang, and Pedro V Sander. Deblur-nerf: Neural radiance fields from blurry images. InProceedings of the IEEE/CVF conference on computer vision and pattern recognition, pages 12861–12870, 2022. 2

  12. [12]

    Mirror-3dgs: Incorporating 8 mirror reflections into 3d gaussian splatting.arXiv preprint arXiv:2404.01168, 2024

    Jiarui Meng, Haijie Li, Yanmin Wu, Qiankun Gao, Shuzhou Yang, Jian Zhang, and Siwei Ma. Mirror-3dgs: Incorporating 8 mirror reflections into 3d gaussian splatting.arXiv preprint arXiv:2404.01168, 2024. 3

  13. [13]

    Srinivasan, Matthew Tancik, Jonathan T

    Ben Mildenhall, Pratul P. Srinivasan, Matthew Tancik, Jonathan T. Barron, Ravi Ramamoorthi, and Ren Ng. NeRF: Representing Scenes as Neural Radiance Fields for View Synthesis. InEuropean Conference on Computer Vision (ECCV), 2020. 2

  14. [14]

    Instant neural graphics primitives with a multiresolution hash encoding.ACM Trans

    Thomas M ¨uller et al. Instant neural graphics primitives with a multiresolution hash encoding.ACM Trans. Graph., 41(4): 102:1–102:15, 2022. 2

  15. [15]

    Nayar and Yasuo Nakagawa

    Shree K. Nayar and Yasuo Nakagawa. Shape from focus. IEEE Transactions on Pattern Analysis and Machine Intelli- gence, 16(8):824–831, 1994. 2

  16. [16]

    Dissco, idigbio and the future of global collaboration.Biodiversity Information Sci- ence and Standards, 2019

    Gil Nelson and Deborah L Paul. Dissco, idigbio and the future of global collaboration.Biodiversity Information Sci- ence and Standards, 2019. 1

  17. [17]

    Capturing natural-colour 3d models of insects for species discovery and diagnostics.PloS one, 9(4):e94346,

    Chuong V Nguyen, David R Lovell, Matt Adcock, and John La Salle. Capturing natural-colour 3d models of insects for species discovery and diagnostics.PloS one, 9(4):e94346,

  18. [18]

    scant—an open-source platform for the creation of 3d models of arthropods (and other small objects).PeerJ, 9:e11155, 2021

    Fabian Plum and David Labonte. scant—an open-source platform for the creation of 3d models of arthropods (and other small objects).PeerJ, 9:e11155, 2021. 2

  19. [19]

    Fast point feature histograms (FPFH) for 3D registration

    Radu Bogdan Rusu, Nico Blodow, and Michael Beetz. Fast point feature histograms (FPFH) for 3D registration. In IEEE International Conference on Robotics and Automation (ICRA), pages 3212–3217, 2009. 4

  20. [20]

    From coarse to fine: Robust hierarchical localization at large scale

    Paul-Edouard Sarlin, Cesar Cadena, Roland Siegwart, and Marcin Dymczyk. From coarse to fine: Robust hierarchical localization at large scale. InIEEE Conference on Computer Vision and Pattern Recognition (CVPR), 2019. 3

  21. [21]

    Superglue: Learning feature matching with graph neural networks

    Paul-Edouard Sarlin, Daniel DeTone, Tomasz Malisiewicz, and Andrew Rabinovich. Superglue: Learning feature matching with graph neural networks. InProceedings of the IEEE/CVF conference on computer vision and pattern recognition, pages 4938–4947, 2020. 3

  22. [22]

    Sch ¨onberger and Jan-Michael Frahm

    Johannes L. Sch ¨onberger and Jan-Michael Frahm. Structure- from-Motion Revisited. InIEEE Conference on Computer Vision and Pattern Recognition (CVPR), pages 4104–4113,

  23. [23]

    Dof-gaussian: Controllable depth-of-field for 3d gaussian splatting

    Liao Shen, Tianqi Liu, Huiqiang Sun, Jiaqi Li, Zhiguo Cao, Wei Li, and Chen Change Loy. Dof-gaussian: Controllable depth-of-field for 3d gaussian splatting. InProceedings of the Computer Vision and Pattern Recognition Conference, pages 26462–26471, 2025. 2

  24. [24]

    Synthesys+ abridged grant proposal.Research Ideas and Outcomes, 5:e46404, 2019

    Vincent Stuart Smith, Kristina Gorman, Wouter Addink, Christos Arvanitidis, Ana Casino, Katherine Dixey, Gabriele Dr¨oge, Quentin Groom, Elspeth Margaret Haston, Donald Hobern, et al. Synthesys+ abridged grant proposal.Research Ideas and Outcomes, 5:e46404, 2019. 1

  25. [25]

    An automated device for the digitiza- tion and 3d modelling of insects, combining extended-depth- of-field and all-side multi-view imaging.ZooKeys, (759):1,

    Bernhard Str ¨obel, Sebastian Schmelzle, Nico Bl ¨uthgen, and Michael Heethoff. An automated device for the digitiza- tion and 3d modelling of insects, combining extended-depth- of-field and all-side multi-view imaging.ZooKeys, (759):1,

  26. [26]

    Lichtfeld studio, 2026

    LichtFeld Studio. Lichtfeld studio, 2026. 5

  27. [27]

    Dof-gs: Adjustable depth-of-field 3d gaussian splatting for post-capture refocusing, defocus rendering and blur removal

    Yujie Wang, Praneeth Chakravarthula, and Baoquan Chen. Dof-gs: Adjustable depth-of-field 3d gaussian splatting for post-capture refocusing, defocus rendering and blur removal. arXiv preprint arXiv:2405.17351, 2024. 2, 6, 7

  28. [28]

    Bovik, Hamid R

    Zhou Wang, Alan C. Bovik, Hamid R. Sheikh, and Eero P. Simoncelli. Image quality assessment: From error visibility to structural similarity.IEEE Transactions on Image Pro- cessing, 13(4):600–612, 2004. 6

  29. [29]

    3d reconstruction of insects: an improved multifocus stacking and an evalu- ation of learning-based mvs approaches

    Chang Xu, Jiayuan Liu, Chuong Nguyen, Fabien Casten, Benoit Maujean, and Simone Gasparini. 3d reconstruction of insects: an improved multifocus stacking and an evalu- ation of learning-based mvs approaches. In2021 Interna- tional Conference on 3D Vision (3DV), pages 1411–1419. IEEE, 2021. 2

  30. [30]

    Efros, Eli Shecht- man, and Oliver Wang

    Richard Zhang, Phillip Isola, Alexei A. Efros, Eli Shecht- man, and Oliver Wang. The unreasonable effectiveness of deep features as a perceptual metric. InIEEE Conference on Computer Vision and Pattern Recognition (CVPR), 2018. 6 9