hub

Open3D: A Modern Library for 3D Data Processing

Qian-Yi Zhou, Jaesik Park, Vladlen Koltun · 2018 · cs.CV · arXiv 1801.09847

46 Pith papers cite this work. Polarity classification is still indexing.

46 Pith papers citing it

open full Pith review browse 46 citing papers arXiv PDF

abstract

Open3D is an open-source library that supports rapid development of software that deals with 3D data. The Open3D frontend exposes a set of carefully selected data structures and algorithms in both C++ and Python. The backend is highly optimized and is set up for parallelization. Open3D was developed from a clean slate with a small and carefully considered set of dependencies. It can be set up on different platforms and compiled from source with minimal effort. The code is clean, consistently styled, and maintained via a clear code review mechanism. Open3D has been used in a number of published research projects and is actively deployed in the cloud. We welcome contributions from the open-source community.

hub tools

JSON dossier citing papers JSON arXiv source

citation-role summary

background 3 method 1

citation-polarity summary

background 3 use method 1

representative citing papers

Scalable and Differentiable Point-Cloud Registration Using Maximum Mean Discrepancy

cs.CV · 2026-06-26 · unverdicted · novelty 7.0

MMD-Reg registers point clouds without correspondences by minimizing an MMD objective approximated via random Fourier features, solved with Levenberg-Marquardt and differentiated via the implicit function theorem for use as a neural network layer.

CARD: A Multi-Modal Automotive Dataset for Dense 3D Reconstruction in Challenging Road Topography

cs.CV · 2026-05-06 · conditional · novelty 7.0

CARD is a new multi-modal driving dataset delivering ~500K dense depth pixels per frame from challenging road topographies using stereo cameras and fused LiDARs over 110 km.

Manifold k-NN: Accelerated k-NN Queries for Manifold Point Clouds

cs.CG · 2026-05-04 · unverdicted · novelty 7.0

Manifold k-NN generalizes DP-NNS to k-NN queries on manifold point clouds via a recursive successor-list property, delivering 1-10x speedups and full dynamic support.

Paired-CSLiDAR: Height-Stratified Registration for Cross-Source Aerial-Ground LiDAR Pose Refinement

cs.RO · 2026-05-01 · conditional · novelty 7.0

Paired-CSLiDAR benchmark and Residual-Guided Stratified Registration achieve 86% success at 0.75 m RMSE on 9,012 cross-source pairs by height-stratified ICP and confidence-gated selection.

PC2Model: ISPRS benchmark on 3D point cloud to model registration

cs.CV · 2026-04-21 · unverdicted · novelty 7.0

PC2Model is a new public benchmark dataset combining simulated and real-world 3D point clouds with corresponding models to train and test registration methods.

ClipGStream: Clip-Stream Gaussian Splatting for Any Length and Any Motion Multi-View Dynamic Scene Reconstruction

cs.CV · 2026-04-15 · unverdicted · novelty 7.0

ClipGStream enables scalable flicker-free reconstruction of long dynamic multi-view videos by performing stream optimization at the clip level with clip-independent spatio-temporal fields, residual anchor compensation, and inter-clip inherited anchors.

SEM-ROVER: Semantic Voxel-Guided Diffusion for Large-Scale Driving Scene Generation

cs.CV · 2026-04-07 · unverdicted · novelty 7.0

SEM-ROVER generates large multiview-consistent 3D urban driving scenes via semantic-conditioned diffusion on Σ-Voxfield voxel grids with progressive outpainting and deferred rendering.

2D Triangle Splatting for Direct Differentiable Mesh Training

cs.CV · 2025-06-23 · unverdicted · novelty 7.0

2D Triangle Splatting uses 2D triangles instead of 3D Gaussians to enable differentiable optimization that yields opaque mesh-like reconstructions with competitive visual quality.

ViVo: A Dataset for Volumetric Video Reconstruction and Compression

cs.CV · 2025-05-31 · conditional · novelty 7.0

ViVo introduces a diverse multi-view volumetric video dataset with raw multi-camera RGB-depth data, calibration, masks, and point clouds to support reconstruction and compression research, with benchmarks highlighting limitations of current methods.

Curve Skeletonization in Continuous domain for Meshes and Point Clouds

cs.GR · 2026-05-25 · unverdicted · novelty 6.0

CSCD generalizes LS to continuous domain with CSCD-M using intrinsic triangulation for meshes and CSCD-PC using tufted Laplacians for point clouds, claiming to match or outperform priors on benchmarks.

LAPS: Improving Incremental LiDAR Mapping using Active Pooling and Sampling for Neural Distance Fields

cs.RO · 2026-05-15 · unverdicted · novelty 6.0

LAPS improves incremental neural LiDAR mapping by combining reliability-based active pooling for sample retention with uncertainty-guided active sampling for optimization focus.

Towards Virtual Qualification in Nuclear Fusion: Demonstrating Probabilistic Model Validation on a High Heat Flux Component

physics.plasm-ph · 2026-05-12 · conditional · novelty 6.0

A probabilistic validation framework with a novel modified area validation metric quantifies finite element model error for fusion heat sinks while separating it from aleatoric and epistemic experimental uncertainties.

Learning Point Cloud Geometry as a Statistical Manifold: Theory and Practice

cs.RO · 2026-05-11 · unverdicted · novelty 6.0

Point cloud geometry is cast as a statistical manifold of per-point Gaussians, with POLI learning the mapping self-supervisedly to improve perception without labeled data.

Globally adaptive and locally regular point discretization of curved surfaces

cs.CE · 2026-05-05 · unverdicted · novelty 6.0

A gradient-descent algorithm with level-set surface representation and dynamic point adjustment generates curvature-adaptive, locally regular point distributions on curved surfaces with low deviation from target spacing.

Structural MAT: Clean and Scalable Medial Axis Simplification via Explicit Surface Correspondence

cs.GR · 2026-05-04 · unverdicted · novelty 6.0

A new MAT simplification algorithm uses explicit surface correspondence tracking and priority-controlled edge collapses to preserve structural features like fillet alignments on discrete meshes.

PRIME: Protein Representation via Physics-Informed Multiscale Equivariant Hierarchies

cs.LG · 2026-05-02 · unverdicted · novelty 6.0 · 2 refs

PRIME is a five-level hierarchical equivariant graph model for proteins that uses physics-informed deterministic operators to exchange information across scales and achieves state-of-the-art results on fold classification and reaction class prediction.

From Stealthy Data Fabrication to Unsafe Driving: Realistic Scenario Attacks on Collaborative Perception

cs.CR · 2026-05-02 · unverdicted · novelty 6.0

A new online attack framework manipulates object poses in shared CAV perception data below detection thresholds, propagating errors to cause unsafe trajectory predictions and behaviors in up to 50% of tested scenarios while evading defenses.

Leveraging Previous-Traversal Point Cloud Map Priors for Camera-Based 3D Object Detection and Tracking

cs.CV · 2026-04-28 · unverdicted · novelty 6.0

DualViewMapDet fuses prior-traversal point cloud maps into camera features via dual perspective-view and bird's-eye-view encoding to improve 3D detection and tracking without LiDAR.

TouchAnything: Diffusion-Guided 3D Reconstruction from Sparse Robot Touches

cs.CV · 2026-04-10 · unverdicted · novelty 6.0

TouchAnything reconstructs accurate 3D object geometries from only a few tactile contacts by optimizing for consistency with a pretrained visual diffusion prior.

Visually-grounded Humanoid Agents

cs.CV · 2026-04-09 · unverdicted · novelty 6.0

A coupled world-agent framework uses 3D Gaussian reconstruction and first-person RGB-D perception with iterative planning to enable goal-directed, collision-avoiding humanoid behavior in novel reconstructed scenes.

HandDreamer: Zero-Shot Text to 3D Hand Model Generation using Corrective Hand Shape Guidance

cs.CV · 2026-04-06 · unverdicted · novelty 6.0

HandDreamer is the first zero-shot text-to-3D method for hands that uses MANO initialization, skeleton-guided diffusion, and corrective shape guidance to produce view-consistent models.

Multimodal-NF: A Wireless Dataset for Near-Field Low-Altitude Sensing and Communications

eess.SP · 2026-03-30 · unverdicted · novelty 6.0

Introduces Multimodal-NF, a synchronized dataset of near-field CSI with RGB, LiDAR, and GPS data plus an open generator for low-altitude XL-MIMO research.

SPEAR-1: Scaling Beyond Robot Demonstrations via 3D Understanding

cs.RO · 2025-11-21 · unverdicted · novelty 6.0

SPEAR-1 combines a 3D-enriched VLM with embodied control to match or exceed existing robotic foundation models using 20 times fewer robot demonstrations.

Spatial-MLLM: Boosting MLLM Capabilities in Visual-based Spatial Intelligence

cs.CV · 2025-05-29 · unverdicted · novelty 6.0 · 2 refs

Spatial-MLLM adds a 3D spatial encoder initialized from a visual geometry model and space-aware frame sampling to MLLMs to improve spatial understanding and reasoning from purely 2D visual inputs.

citing papers explorer

Showing 46 of 46 citing papers.

Scalable and Differentiable Point-Cloud Registration Using Maximum Mean Discrepancy cs.CV · 2026-06-26 · unverdicted · none · ref 47 · internal anchor
MMD-Reg registers point clouds without correspondences by minimizing an MMD objective approximated via random Fourier features, solved with Levenberg-Marquardt and differentiated via the implicit function theorem for use as a neural network layer.
CARD: A Multi-Modal Automotive Dataset for Dense 3D Reconstruction in Challenging Road Topography cs.CV · 2026-05-06 · conditional · none · ref 63 · internal anchor
CARD is a new multi-modal driving dataset delivering ~500K dense depth pixels per frame from challenging road topographies using stereo cameras and fused LiDARs over 110 km.
Manifold k-NN: Accelerated k-NN Queries for Manifold Point Clouds cs.CG · 2026-05-04 · unverdicted · none · ref 21 · internal anchor
Manifold k-NN generalizes DP-NNS to k-NN queries on manifold point clouds via a recursive successor-list property, delivering 1-10x speedups and full dynamic support.
Paired-CSLiDAR: Height-Stratified Registration for Cross-Source Aerial-Ground LiDAR Pose Refinement cs.RO · 2026-05-01 · conditional · none · ref 41 · internal anchor
Paired-CSLiDAR benchmark and Residual-Guided Stratified Registration achieve 86% success at 0.75 m RMSE on 9,012 cross-source pairs by height-stratified ICP and confidence-gated selection.
PC2Model: ISPRS benchmark on 3D point cloud to model registration cs.CV · 2026-04-21 · unverdicted · none · ref 27 · internal anchor
PC2Model is a new public benchmark dataset combining simulated and real-world 3D point clouds with corresponding models to train and test registration methods.
ClipGStream: Clip-Stream Gaussian Splatting for Any Length and Any Motion Multi-View Dynamic Scene Reconstruction cs.CV · 2026-04-15 · unverdicted · none · ref 59 · internal anchor
ClipGStream enables scalable flicker-free reconstruction of long dynamic multi-view videos by performing stream optimization at the clip level with clip-independent spatio-temporal fields, residual anchor compensation, and inter-clip inherited anchors.
SEM-ROVER: Semantic Voxel-Guided Diffusion for Large-Scale Driving Scene Generation cs.CV · 2026-04-07 · unverdicted · none · ref 35 · internal anchor
SEM-ROVER generates large multiview-consistent 3D urban driving scenes via semantic-conditioned diffusion on Σ-Voxfield voxel grids with progressive outpainting and deferred rendering.
2D Triangle Splatting for Direct Differentiable Mesh Training cs.CV · 2025-06-23 · unverdicted · none · ref 37 · internal anchor
2D Triangle Splatting uses 2D triangles instead of 3D Gaussians to enable differentiable optimization that yields opaque mesh-like reconstructions with competitive visual quality.
ViVo: A Dataset for Volumetric Video Reconstruction and Compression cs.CV · 2025-05-31 · conditional · none · ref 45 · internal anchor
ViVo introduces a diverse multi-view volumetric video dataset with raw multi-camera RGB-depth data, calibration, masks, and point clouds to support reconstruction and compression research, with benchmarks highlighting limitations of current methods.
Curve Skeletonization in Continuous domain for Meshes and Point Clouds cs.GR · 2026-05-25 · unverdicted · none · ref 63 · internal anchor
CSCD generalizes LS to continuous domain with CSCD-M using intrinsic triangulation for meshes and CSCD-PC using tufted Laplacians for point clouds, claiming to match or outperform priors on benchmarks.
LAPS: Improving Incremental LiDAR Mapping using Active Pooling and Sampling for Neural Distance Fields cs.RO · 2026-05-15 · unverdicted · none · ref 24 · internal anchor
LAPS improves incremental neural LiDAR mapping by combining reliability-based active pooling for sample retention with uncertainty-guided active sampling for optimization focus.
Towards Virtual Qualification in Nuclear Fusion: Demonstrating Probabilistic Model Validation on a High Heat Flux Component physics.plasm-ph · 2026-05-12 · conditional · none · ref 39 · internal anchor
A probabilistic validation framework with a novel modified area validation metric quantifies finite element model error for fusion heat sinks while separating it from aleatoric and epistemic experimental uncertainties.
Learning Point Cloud Geometry as a Statistical Manifold: Theory and Practice cs.RO · 2026-05-11 · unverdicted · none · ref 57 · internal anchor
Point cloud geometry is cast as a statistical manifold of per-point Gaussians, with POLI learning the mapping self-supervisedly to improve perception without labeled data.
Globally adaptive and locally regular point discretization of curved surfaces cs.CE · 2026-05-05 · unverdicted · none · ref 48 · internal anchor
A gradient-descent algorithm with level-set surface representation and dynamic point adjustment generates curvature-adaptive, locally regular point distributions on curved surfaces with low deviation from target spacing.
Structural MAT: Clean and Scalable Medial Axis Simplification via Explicit Surface Correspondence cs.GR · 2026-05-04 · unverdicted · none · ref 179 · internal anchor
A new MAT simplification algorithm uses explicit surface correspondence tracking and priority-controlled edge collapses to preserve structural features like fillet alignments on discrete meshes.
PRIME: Protein Representation via Physics-Informed Multiscale Equivariant Hierarchies cs.LG · 2026-05-02 · unverdicted · none · ref 29 · 2 links · internal anchor
PRIME is a five-level hierarchical equivariant graph model for proteins that uses physics-informed deterministic operators to exchange information across scales and achieves state-of-the-art results on fold classification and reaction class prediction.
From Stealthy Data Fabrication to Unsafe Driving: Realistic Scenario Attacks on Collaborative Perception cs.CR · 2026-05-02 · unverdicted · none · ref 63 · internal anchor
A new online attack framework manipulates object poses in shared CAV perception data below detection thresholds, propagating errors to cause unsafe trajectory predictions and behaviors in up to 50% of tested scenarios while evading defenses.
Leveraging Previous-Traversal Point Cloud Map Priors for Camera-Based 3D Object Detection and Tracking cs.CV · 2026-04-28 · unverdicted · none · ref 34 · internal anchor
DualViewMapDet fuses prior-traversal point cloud maps into camera features via dual perspective-view and bird's-eye-view encoding to improve 3D detection and tracking without LiDAR.
TouchAnything: Diffusion-Guided 3D Reconstruction from Sparse Robot Touches cs.CV · 2026-04-10 · unverdicted · none · ref 55 · internal anchor
TouchAnything reconstructs accurate 3D object geometries from only a few tactile contacts by optimizing for consistency with a pretrained visual diffusion prior.
Visually-grounded Humanoid Agents cs.CV · 2026-04-09 · unverdicted · none · ref 120 · internal anchor
A coupled world-agent framework uses 3D Gaussian reconstruction and first-person RGB-D perception with iterative planning to enable goal-directed, collision-avoiding humanoid behavior in novel reconstructed scenes.
HandDreamer: Zero-Shot Text to 3D Hand Model Generation using Corrective Hand Shape Guidance cs.CV · 2026-04-06 · unverdicted · none · ref 64 · internal anchor
HandDreamer is the first zero-shot text-to-3D method for hands that uses MANO initialization, skeleton-guided diffusion, and corrective shape guidance to produce view-consistent models.
Multimodal-NF: A Wireless Dataset for Near-Field Low-Altitude Sensing and Communications eess.SP · 2026-03-30 · unverdicted · none · ref 15 · internal anchor
Introduces Multimodal-NF, a synchronized dataset of near-field CSI with RGB, LiDAR, and GPS data plus an open generator for low-altitude XL-MIMO research.
SPEAR-1: Scaling Beyond Robot Demonstrations via 3D Understanding cs.RO · 2025-11-21 · unverdicted · none · ref 52 · internal anchor
SPEAR-1 combines a 3D-enriched VLM with embodied control to match or exceed existing robotic foundation models using 20 times fewer robot demonstrations.
Spatial-MLLM: Boosting MLLM Capabilities in Visual-based Spatial Intelligence cs.CV · 2025-05-29 · unverdicted · none · ref 71 · 2 links · internal anchor
Spatial-MLLM adds a 3D spatial encoder initialized from a visual geometry model and space-aware frame sampling to MLLMs to improve spatial understanding and reasoning from purely 2D visual inputs.
Thinking in Space: How Multimodal Large Language Models See, Remember, and Recall Spaces cs.CV · 2024-12-18 · unverdicted · none · ref 106 · internal anchor
MLLMs achieve competitive but subhuman performance on the new VSI-Bench for visual-spatial intelligence from videos, with spatial reasoning as the main bottleneck and explicit cognitive map generation improving distance estimation.
GEM: Generative Supervision Helps Embodied Intelligence cs.CV · 2026-05-27 · unverdicted · none · ref 103 · internal anchor
GEM adds generative depth supervision to VLM pre-training and reports improved results on embodied benchmarks plus real-world robot execution.
PQDT: Pseudo-Query Dual Transformer for Robust Point Cloud Restoration cs.CV · 2026-05-24 · unverdicted · none · ref 76 · internal anchor
PQDT is a unified Transformer-based network using a Pseudo-Query module to restore high-quality point cloud geometry from diverse degradations, claiming to surpass prior methods on combined completion, denoising, and deformation tasks.
From Full and Partial Intraoral Scans to Crown Proposal: A Classification-Guided Restoration Assistance Pipeline eess.IV · 2026-05-14 · unverdicted · none · ref 30 · internal anchor
A classification-routed pipeline segments partial and full intraoral scans then retrieves and fits crown proposals from neighboring teeth embeddings, reporting macro DSC 0.9249 on 1958 partial scans.
Real-Scale Island Area and Coastline Estimation using Only its Place Name or Coordinates cs.CV · 2026-05-11 · unverdicted · none · ref 13 · internal anchor
A monocular vision system estimates real-scale island area and coastline length with around 10% error using only place name or coordinates input via automated image capture, point cloud generation, and trajectory alignment.
Point Cloud Registration via Probabilistic Self-Update Local Correspondence and Line Vector Sets cs.CV · 2026-04-29 · conditional · none · ref 48 · internal anchor
A new PCR algorithm using probabilistic self-update local correspondence and line vector sets achieves superior time efficiency and at least 10% RMSE improvement over state-of-the-art methods.
MyoVision: A Mobile Research Tool and NEATBoost-Attention Ensemble Framework for Real Time Chicken Breast Myopathy Detection cs.LG · 2026-04-15 · unverdicted · none · ref 59 · internal anchor
Smartphone transillumination imaging paired with a neuroevolution-tuned ensemble model classifies chicken breast myopathies at 82.4% accuracy on 336 fillets, matching costly hyperspectral systems.
MeshOn: Intersection-Free Mesh-to-Mesh Composition cs.GR · 2026-04-09 · unverdicted · none · ref 49 · internal anchor
MeshOn composes two input meshes realistically without intersections by using VLM-based rigid initialization, attractive geometric losses, a barrier loss, and a diffusion prior for final deformation.
R3PM-Net: Real-time, Robust, Real-world Point Matching Network cs.CV · 2026-04-06 · conditional · none · ref 41 · internal anchor
R3PM-Net delivers real-time point cloud registration with high accuracy on synthetic and real-world datasets through a global-aware lightweight architecture and new evaluation benchmarks.
Real-to-Sim for Highly Cluttered Environments via Physics-Consistent Inter-Object Reasoning cs.RO · 2026-02-13 · unverdicted · none · ref 25 · internal anchor
A differentiable optimization pipeline uses a contact graph and rigid-body simulation to jointly refine object poses and physical properties, producing physically valid 3D scene reconstructions from single-view RGB-D observations for cluttered environments.
REACT3D: Recovering Articulations for Interactive Physical 3D Scenes cs.CV · 2025-10-13 · unverdicted · none · ref 34 · internal anchor
A zero-shot framework that recovers part articulations and produces simulation-compatible interactive 3D scene replicas from static inputs.
Geometry-Aware Scene Configurations for Novel View Synthesis cs.CV · 2025-10-10 · unverdicted · none · ref 54 · internal anchor
Geometry-guided adaptive placement of bases and virtual viewpoints improves rendering quality and memory use over uniform arrangements in scalable NeRF for large indoor scenes.
DreamLifting: A Plug-in Module Lifting MV Diffusion Models for 3D Asset Generation cs.CV · 2025-09-09 · unverdicted · none · ref 83 · internal anchor
LGAA is a modular adapter framework that lifts multi-view diffusion models to produce 2D Gaussian Splats with PBR channels for high-quality relightable 3D mesh extraction using data-efficient finetuning on 69k instances.
Hestia: Voxel-Face-Aware Hierarchical Next-Best-View Acquisition for Efficient 3D Reconstruction cs.RO · 2025-08-01 · conditional · none · ref 86 · internal anchor
Hestia improves generalizable next-best-view planning for 3D reconstruction via hierarchical action search, diverse data, close-greedy strategy, and face-aware voxel design, yielding higher coverage and lower Chamfer distance than prior RL-based methods.
3D Densification for Multi-Map Monocular VSLAM in Endoscopy cs.CV · 2025-03-18 · unverdicted · none · ref 35 · internal anchor
A densification pipeline for multi-map monocular endoscopic VSLAM that aligns NN LightDepth predictions to CudaSIFT sparse submaps via LMedS, reporting 4.15 mm RMS accuracy on the C3VD phantom dataset.
EnforceNet: Monocular Camera Localization in Large Scale Indoor Sparse LiDAR Point Cloud cs.CV · 2019-07-16 · unverdicted · none · ref 36 · internal anchor
EnforceNet achieves centimeter-level monocular camera localization in sparse LiDAR maps of indoor parking garages via a novel resistor module that improves generalization, accuracy, and training speed.
Towards Affordance Prediction with Vision via Task Oriented Grasp Quality Metrics cs.RO · 2019-07-10 · unverdicted · none · ref 24 · internal anchor
The work defines task-oriented grasp metrics from basic ones and trains vision models to infer them in both known-model simulation and partial-information range-image settings.
Efficiently Closing Loops in LiDAR-Based SLAM Using Point Cloud Density Maps cs.RO · 2025-01-13 · unverdicted · none · ref 71 · internal anchor
Introduces a sensor-agnostic loop closure pipeline for LiDAR SLAM using density maps, ground alignment, ORB on BEV projections, BST retrieval, and pruning to handle perceptual aliasing.
Contactless 3D Human Body Measurement Using Depth Cameras for Smart Health Monitoring cs.CV · 2026-06-10 · unverdicted · none · ref 1 · internal anchor
Framework for contactless anthropometric measurements from depth camera point clouds using standard libraries to segment body and compute linear dimensions plus volume and area.
A Survey on Deep Learning Architectures for Point Cloud Classification and Segmentation cs.CV · 2026-05-16 · unverdicted · none · ref 137 · 2 links · internal anchor
A survey that categorizes deep learning models for point cloud tasks by backbone architecture, evaluates benchmark performance, and outlines challenges and future research directions.
Bimanual Robot Manipulation via Multi-Agent In-Context Learning cs.RO · 2026-04-22 · unreviewed · ref 53 · internal anchor
GSDeformer: Direct, Real-time and Extensible Cage-based Deformation for 3D Gaussian Splatting cs.CV · 2024-05-24 · unreviewed · ref 39 · internal anchor

Open3D: A Modern Library for 3D Data Processing

hub tools

citation-role summary

citation-polarity summary

fields

years

verdicts

roles

polarities

representative citing papers

citing papers explorer