Towards Multimodal Depth Estimation from Light Fields
read the original abstract
Light field applications, especially light field rendering and depth estimation, developed rapidly in recent years. While state-of-the-art light field rendering methods handle semi-transparent and reflective objects well, depth estimation methods either ignore these cases altogether or only deliver a weak performance. We argue that this is due current methods only considering a single "true" depth, even when multiple objects at different depths contributed to the color of a single pixel. Based on the simple idea of outputting a posterior depth distribution instead of only a single estimate, we develop and explore several different deep-learning-based approaches to the problem. Additionally, we contribute the first "multimodal light field depth dataset" that contains the depths of all objects which contribute to the color of a pixel. This allows us to supervise the multimodal depth prediction and also validate all methods by measuring the KL divergence of the predicted posteriors. With our thorough analysis and novel dataset, we aim to start a new line of depth estimation research that overcomes some of the long-standing limitations of this field.
This paper has not been read by Pith yet.
Forward citations
Cited by 1 Pith paper
-
DSER: Spectral Epipolar Representation for Efficient Light Field Depth Estimation
DSER combines spectral epipolar regularization with a hybrid pipeline of gradient initialization, plane-sweeping, multiscale refinement, and occlusion-aware random walk to produce structurally consistent depth maps fr...
discussion (0)
Sign in with ORCID, Apple, or X to comment. Anyone can read and Pith papers without signing in.