arXiv preprint arXiv:2312.01283 (2023)

Chao Fan, Zhenyu Yin, Yue Li, Feiqing Zhang · 2023 · arXiv 2312.01283

3 Pith papers cite this work. Polarity classification is still indexing.

3 Pith papers citing it

read on arXiv browse 3 citing papers

citation-role summary

background 2

citation-polarity summary

background 2

representative citing papers

Improved monocular depth prediction using distance transform over pre-semantic contours with self-supervised neural networks

eess.IV · 2026-05-08 · unverdicted · novelty 7.0

Self-supervised monocular depth estimation improves in low-texture regions by using distance transforms on jointly estimated pre-semantic contours to create more informative loss signals.

SS3D: End2End Self-Supervised 3D from Web Videos

cs.CV · 2026-04-24 · unverdicted · novelty 6.0 · 3 refs

SS3D pretrains an end-to-end feed-forward 3D estimator on filtered YouTube-8M videos via SfM self-supervision, MVS filtering, and expert distillation, delivering stronger zero-shot transfer and fine-tuning than prior self-supervised baselines.

Unified 3D Scene Understanding Through Physical World Modeling

cs.CV · 2026-05-23 · unverdicted · novelty 5.0

A probabilistic graphical model called 3WM unifies 3D vision tasks into one system that performs them zero-shot by selecting different inference pathways through multimodal scene nodes.

citing papers explorer

Showing 3 of 3 citing papers.

Improved monocular depth prediction using distance transform over pre-semantic contours with self-supervised neural networks eess.IV · 2026-05-08 · unverdicted · none · ref 16
Self-supervised monocular depth estimation improves in low-texture regions by using distance transforms on jointly estimated pre-semantic contours to create more informative loss signals.
SS3D: End2End Self-Supervised 3D from Web Videos cs.CV · 2026-04-24 · unverdicted · none · ref 11 · 3 links
SS3D pretrains an end-to-end feed-forward 3D estimator on filtered YouTube-8M videos via SfM self-supervision, MVS filtering, and expert distillation, delivering stronger zero-shot transfer and fine-tuning than prior self-supervised baselines.
Unified 3D Scene Understanding Through Physical World Modeling cs.CV · 2026-05-23 · unverdicted · none · ref 4
A probabilistic graphical model called 3WM unifies 3D vision tasks into one system that performs them zero-shot by selecting different inference pathways through multimodal scene nodes.

arXiv preprint arXiv:2312.01283 (2023)

citation-role summary

citation-polarity summary

fields

years

verdicts

roles

polarities

representative citing papers

citing papers explorer