Canonical reference

In: Proceedings of the IEEE/CVF ICCV, pp

Liu, Z · 2021

Canonical reference. 100% of citing Pith papers cite this work as background.

6 Pith papers citing it

Background 100% of classified citations

browse 6 citing papers

citation-role summary

background 7 dataset 1

citation-polarity summary

background 8

representative citing papers

Continuous Expert Assembly: Instance-Conditioned Low-Rank Residuals for All-in-One Image Restoration

cs.CV · 2026-05-07 · unverdicted · novelty 7.0

CEA assembles per-token low-rank residual updates via dense affinities over hyper-adapter-generated components to improve all-in-one image restoration on spatially non-uniform degradations.

Driver-WM: A Driver-Centric Traffic-Conditioned Latent World Model for In-Cabin Dynamics Rollout

cs.RO · 2026-05-06 · unverdicted · novelty 7.0

Driver-WM is a driver-centric latent world model for causal rollout of in-cabin dynamics conditioned on out-cabin traffic, unifying kinematics forecasting with behavioral and emotional recognition via dual-stream architecture and gated injection.

Beyond Seeing Is Believing: On Crowdsourced Detection of Audiovisual Deepfakes

cs.IR · 2026-05-06 · unverdicted · novelty 5.0

Crowdsourced judgments reliably flag authentic videos but frequently miss manipulations and struggle to identify whether changes are audio-only, video-only, or both.

InterFuserDVS: Event-Enhanced Sensor Fusion for Safe RL-Based Decision Making

cs.CV · 2026-05-05 · unverdicted · novelty 5.0 · 2 refs

Integrating DVS event data into InterFuser through token fusion yields a driving score of 77.2 and 100% route completion on CARLA benchmarks, indicating improved robustness in dynamic conditions.

From Spherical to Gaussian: A Comparative Analysis of Point Cloud Cropping Strategies in Large-Scale 3D Environments

cs.CV · 2026-05-03 · unverdicted · novelty 5.0 · 3 refs

Gaussian and related cropping strategies for point cloud subclouds improve 3D neural network performance over spherical cropping on large outdoor scenes.

Retina-RAG: Retrieval-Augmented Vision-Language Modeling for Joint Retinal Diagnosis and Clinical Report Generation

cs.CV · 2026-05-07 · unverdicted · novelty 4.0

Retina-RAG combines a retinal classifier, LoRA-tuned Qwen2.5-VL, and RAG to jointly grade DR, detect ME, and generate reports, reaching F1 scores of 0.731 and 0.948 while exceeding baselines on ROUGE-L and SBERT metrics.

citing papers explorer

Showing 6 of 6 citing papers.

Continuous Expert Assembly: Instance-Conditioned Low-Rank Residuals for All-in-One Image Restoration cs.CV · 2026-05-07 · unverdicted · none · ref 23
CEA assembles per-token low-rank residual updates via dense affinities over hyper-adapter-generated components to improve all-in-one image restoration on spatially non-uniform degradations.
Driver-WM: A Driver-Centric Traffic-Conditioned Latent World Model for In-Cabin Dynamics Rollout cs.RO · 2026-05-06 · unverdicted · none · ref 1
Driver-WM is a driver-centric latent world model for causal rollout of in-cabin dynamics conditioned on out-cabin traffic, unifying kinematics forecasting with behavioral and emotional recognition via dual-stream architecture and gated injection.
Beyond Seeing Is Believing: On Crowdsourced Detection of Audiovisual Deepfakes cs.IR · 2026-05-06 · unverdicted · none · ref 52
Crowdsourced judgments reliably flag authentic videos but frequently miss manipulations and struggle to identify whether changes are audio-only, video-only, or both.
InterFuserDVS: Event-Enhanced Sensor Fusion for Safe RL-Based Decision Making cs.CV · 2026-05-05 · unverdicted · none · ref 19 · 2 links
Integrating DVS event data into InterFuser through token fusion yields a driving score of 77.2 and 100% route completion on CARLA benchmarks, indicating improved robustness in dynamic conditions.
From Spherical to Gaussian: A Comparative Analysis of Point Cloud Cropping Strategies in Large-Scale 3D Environments cs.CV · 2026-05-03 · unverdicted · none · ref 36 · 3 links
Gaussian and related cropping strategies for point cloud subclouds improve 3D neural network performance over spherical cropping on large outdoor scenes.
Retina-RAG: Retrieval-Augmented Vision-Language Modeling for Joint Retinal Diagnosis and Clinical Report Generation cs.CV · 2026-05-07 · unverdicted · none · ref 33
Retina-RAG combines a retinal classifier, LoRA-tuned Qwen2.5-VL, and RAG to jointly grade DR, detect ME, and generate reports, reaching F1 scores of 0.731 and 0.948 while exceeding baselines on ROUGE-L and SBERT metrics.

In: Proceedings of the IEEE/CVF ICCV, pp

citation-role summary

citation-polarity summary

fields

years

verdicts

roles

polarities

representative citing papers

citing papers explorer