SuperPoint: Self-Supervised Interest Point Detection and Description
pith:DOM47ZVA Add to your LaTeX paper
What is a Pith Number?\usepackage{pith}
\pithnumber{DOM47ZVA}
Prints a linked pith:DOM47ZVA badge after your title and writes the identifier into PDF metadata. Compiles on arXiv with no extra files. Learn more
read the original abstract
This paper presents a self-supervised framework for training interest point detectors and descriptors suitable for a large number of multiple-view geometry problems in computer vision. As opposed to patch-based neural networks, our fully-convolutional model operates on full-sized images and jointly computes pixel-level interest point locations and associated descriptors in one forward pass. We introduce Homographic Adaptation, a multi-scale, multi-homography approach for boosting interest point detection repeatability and performing cross-domain adaptation (e.g., synthetic-to-real). Our model, when trained on the MS-COCO generic image dataset using Homographic Adaptation, is able to repeatedly detect a much richer set of interest points than the initial pre-adapted deep model and any other traditional corner detector. The final system gives rise to state-of-the-art homography estimation results on HPatches when compared to LIFT, SIFT and ORB.
This paper has not been read by Pith yet.
Forward citations
Cited by 5 Pith papers
-
Hardware-Aware Neural Feature Extraction for Resource-Constrained Devices
Gideon is a hardware-aware feature extractor using distillation and DNAS that achieves 111 fps on STM32N6 under 1.5 MB memory with negligible INT8 quantization loss.
-
From Seeing to Simulating: Generative High-Fidelity Simulation with Digital Cousins for Generalizable Robot Learning and Evaluation
Digital Cousins is a generative real-to-sim method that creates diverse high-fidelity simulation scenes from real panoramas to improve generalization in robot learning and evaluation.
-
Leveraging AV1 motion vectors for Fast and Dense Feature Matching
AV1 motion vectors filtered by cosine consistency yield dense sub-pixel correspondences that support structure-from-motion on short video clips with lower CPU cost and higher match density than sequential SIFT.
-
Automated Annotation of Shearographic Measurements Enabling Weakly Supervised Defect Detection
An automated annotation pipeline combining Grounded DINO and SAM produces usable bounding boxes and masks for weakly supervised defect detection in shearography.
-
Example-Based Object Detection
EBOD integrates SAM3 with DINOv3 and LightGlue to leverage previous error examples and suppress recurring false positives and negatives without retraining.
discussion (0)
Sign in with ORCID, Apple, or X to comment. Anyone can read and Pith papers without signing in.