pith. sign in

hub Mixed citations

Virtual KITTI 2

Mixed citation behavior. Most common role is background (50%).

52 Pith papers citing it
Background 50% of classified citations
abstract

This paper introduces an updated version of the well-known Virtual KITTI dataset which consists of 5 sequence clones from the KITTI tracking benchmark. In addition, the dataset provides different variants of these sequences such as modified weather conditions (e.g. fog, rain) or modified camera configurations (e.g. rotated by 15 degrees). For each sequence, we provide multiple sets of images containing RGB, depth, class segmentation, instance segmentation, flow, and scene flow data. Camera parameters and poses as well as vehicle locations are available as well. In order to showcase some of the dataset's capabilities, we ran multiple relevant experiments using state-of-the-art algorithms from the field of autonomous driving. The dataset is available for download at https://europe.naverlabs.com/Research/Computer-Vision/Proxy-Virtual-Worlds.

hub tools

citation-role summary

dataset 8 background 3 method 1

citation-polarity summary

fields

cs.CV 51 cs.RO 1

clear filters

representative citing papers

Image Generators are Generalist Vision Learners

cs.CV · 2026-04-22 · conditional · novelty 7.0 · 2 refs

An image generator is instruction-tuned to perform diverse vision tasks by representing task outputs as RGB images, achieving SOTA on segmentation and depth estimation.

PointDiT: Pixel-Space Diffusion for Monocular Geometry Estimation

cs.CV · 2026-07-02 · unverdicted · novelty 6.0

PointDiT is a from-scratch pixel-space Diffusion Transformer for monocular 3D point map estimation that outperforms latent diffusion models in sharpness and ambiguous regions while using a simpler architecture.

Argus: Metric Panoramic 3D Reconstruction for Indoor Scenes

cs.CV · 2026-06-29 · unverdicted · novelty 6.0 · 2 refs

Argus introduces a covisibility module and decomposed pixel-to-world mapping to deliver SOTA metric performance on camera pose, depth, and point cloud tasks using the Realsee3D panoramic dataset.

Prompting Diffusion Models for Zero-Shot Instance Segmentation

cs.CV · 2026-06-21 · unverdicted · novelty 6.0

Prompt2Seg augments diffusion models with an explicit spatial prompt conditioning branch, enabling zero-shot instance segmentation that generalizes from limited synthetic category training to diverse unseen objects and visual domains.

citing papers explorer

Showing 50 of 51 citing papers after filters.