Towards Multimodal Depth Estimation from Light Fields

Carsten Rother; Lynton Ardizzone; Radek Mackowiak; Titus Leistner; Ullrich K\"othe

arxiv: 2203.16542 · v2 · pith:QYYV5ZSCnew · submitted 2022-03-30 · 💻 cs.CV

Towards Multimodal Depth Estimation from Light Fields

Titus Leistner , Radek Mackowiak , Lynton Ardizzone , Ullrich K\"othe , Carsten Rother This is my paper

classification 💻 cs.CV

keywords depthfieldlightestimationmethodsmultimodalobjectsonly

0 comments

read the original abstract

Light field applications, especially light field rendering and depth estimation, developed rapidly in recent years. While state-of-the-art light field rendering methods handle semi-transparent and reflective objects well, depth estimation methods either ignore these cases altogether or only deliver a weak performance. We argue that this is due current methods only considering a single "true" depth, even when multiple objects at different depths contributed to the color of a single pixel. Based on the simple idea of outputting a posterior depth distribution instead of only a single estimate, we develop and explore several different deep-learning-based approaches to the problem. Additionally, we contribute the first "multimodal light field depth dataset" that contains the depths of all objects which contribute to the color of a pixel. This allows us to supervise the multimodal depth prediction and also validate all methods by measuring the KL divergence of the predicted posteriors. With our thorough analysis and novel dataset, we aim to start a new line of depth estimation research that overcomes some of the long-standing limitations of this field.

This paper has not been read by Pith yet.

discussion (0)

Forward citations

Cited by 1 Pith paper

Reviewed papers in the Pith corpus that reference this work. Sorted by Pith novelty score.

DSER: Spectral Epipolar Representation for Efficient Light Field Depth Estimation
cs.CV 2025-08 unverdicted novelty 6.0

DSER combines spectral epipolar regularization with a hybrid pipeline of gradient initialization, plane-sweeping, multiscale refinement, and occlusion-aware random walk to produce structurally consistent depth maps fr...