Winner of CVPR2026 NTIRE Challenge on Image Shadow Removal: Semantic and Geometric Guidance for Shadow Removal via Cascaded Refinement
Pith reviewed 2026-05-10 08:40 UTC · model grok-4.3
The pith
A three-stage progressive refinement model guided by DINOv2 semantics and geometric depth/normals cues won the NTIRE 2026 image shadow removal challenge with top scores of 26.68 PSNR and 0.874 SSIM.
A machine-rendered reading of the paper's core claim, the machinery that carries it, and where it could break.
Core claim
On the official WSRD+ 2026 hidden test set, our final ensemble achieved 26.680 PSNR, 0.8740 SSIM, 0.0578 LPIPS, and 26.135 FID, ranked first overall, and won the NTIRE 2026 Image Shadow Removal Challenge.
Load-bearing premise
That the frozen DINOv2 embeddings and monocular depth/normal estimates supply sufficiently accurate and task-relevant guidance, and that the contraction-constrained objective will reliably stabilize the cascade without capping final performance or causing under-fitting on complex shadows.
Figures
read the original abstract
We present a three-stage progressive shadow-removal pipeline for the CVPR2026 NTIRE WSRD+ challenge. Built on OmniSR, our method treats deshadowing as iterative direct refinement, where later stages correct residual artefacts left by earlier predictions. The model combines RGB appearance with frozen DINOv2 semantic guidance and geometric cues from monocular depth and surface normals, reused across all stages. To stabilise multi-stage optimisation, we introduce a contraction-constrained objective that encourages non-increasing reconstruction error across the cascade. A staged training pipeline transfers from earlier WSRD pretraining to WSRD+ supervision and final WSRD+ 2026 adaptation with cosine-annealed checkpoint ensembling. On the official WSRD+ 2026 hidden test set, our final ensemble achieved 26.680 PSNR, 0.8740 SSIM, 0.0578 LPIPS, and 26.135 FID, ranked first overall, and won the NTIRE 2026 Image Shadow Removal Challenge. The strong performance of the proposed model is further validated on the ISTD+ and UAV-SC+ datasets.
Editorial analysis
A structured set of objections, weighed in public.
Axiom & Free-Parameter Ledger
free parameters (1)
- contraction constraint strength
axioms (2)
- domain assumption Frozen DINOv2 features supply useful semantic context for shadow removal
- domain assumption Monocular depth and normal estimates are sufficiently accurate to guide shadow removal
Reference graph
Works this paper leans on
-
[1]
Seyed Kazem Alavipanah, Mohammad Karimi Firozjaei, Amir Sedighi, Solmaz Fathololoumi, Saeid Zare Naghadehi, Samiraalsadat Saleh, Maryam Naghdizadegan, Zinat Gomeh, Jamal Jokar Arsanjani, Mohsen Makki, et al. The shadow effect on surface biophysical variables derived from remote sensing: a review.Land, 11(11):2025, 2022. 1
work page 2025
-
[2]
Muhammad Umair Arif, Muhammad Umar Farooq, Rana Hammad Raza, Zain Ul Abideen Lodhi, and Muham- mad Abdur Rehman Hashmi. A comprehensive review of vehicle detection techniques under varying moving cast shadow conditions using computer vision and deep learning. IEEE Access, 10:104863–104886, 2022. 1
work page 2022
-
[3]
Jun Chu, Kaichen Chi, and Qi Wang. Rmmamba: Ran- domized mamba for remote sensing shadow removal.IEEE Transactions on Geoscience and Remote Sensing, 2025. 5, 8
work page 2025
-
[4]
Rita Cucchiara, Costantino Grana, Massimo Piccardi, and Andrea Prati. Detecting moving objects, ghosts, and shad- ows in video streams.IEEE Transactions on Pattern Anal- ysis and Machine Intelligence, 25(10):1337–1342, 2003. 1, 2
work page 2003
-
[5]
Auto- exposure fusion for single-image shadow removal
Lan Fu, Changqing Zhou, Qing Guo, Felix Juefei-Xu, Hongkai Yu, Wei Feng, Yang Liu, and Song Wang. Auto- exposure fusion for single-image shadow removal. InPro- ceedings of the IEEE/CVF conference on computer vision and pattern recognition, pages 10571–10580, 2021. 8
work page 2021
-
[6]
Interactive shadow removal and ground truth for variable scene categories
Han Gong and DP Cosker. Interactive shadow removal and ground truth for variable scene categories. InBMVC 2014- Proceedings of the British Machine Vision Conference 2014,
work page 2014
-
[7]
Shadowformer: Global context helps image shadow removal
Lanqing Guo, Siyu Huang, Ding Liu, Hao Cheng, and Bihan Wen. Shadowformer: Global context helps image shadow removal. InAAAI, 2023. 1, 2, 8
work page 2023
-
[8]
Shadowd- iffusion: When degradation prior meets diffusion model for shadow removal
Lanqing Guo, Chong Wang, Wenhan Yang, Siyu Huang, Yufei Wang, Hanspeter Pfister, and Bihan Wen. Shadowd- iffusion: When degradation prior meets diffusion model for shadow removal. InCVPR, pages 14049–14058, 2023. 1, 2, 8
work page 2023
-
[9]
Mask-shadowgan: Learning to remove shadows from unpaired data
Xiaowei Hu, Yitong Jiang, Chi-Wing Fu, and Pheng-Ann Heng. Mask-shadowgan: Learning to remove shadows from unpaired data. InProceedings of the IEEE/CVF interna- tional conference on computer vision, pages 2472–2481,
-
[10]
Naoto Inoue and Toshihiko Yamasaki. Learning from syn- thetic shadows for shadow detection and removal.IEEE Transactions on Circuits and Systems for Video Technology, 31(11):4187–4197, 2020. 2
work page 2020
-
[11]
Re- thinking fid: Towards a better evaluation metric for image generation
Sadeep Jayasumana, Srikumar Ramalingam, Andreas Veit, Daniel Glasner, Ayan Chakrabarti, and Sanjiv Kumar. Re- thinking fid: Towards a better evaluation metric for image generation. InIEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR), pages 9307–9315. IEEE,
-
[12]
Yeying Jin, Aashish Sharma, and Robby T Tan. Dc- shadownet: Single-image hard and soft shadow removal us- ing unsupervised domain-classifier guided network. InPro- ceedings of the IEEE/CVF international conference on com- puter vision, pages 5027–5036, 2021. 8
work page 2021
-
[13]
Des3: Adaptive attention-driven self and soft shadow removal using vit similarity
Yeying Jin, Wei Ye, Wenhan Yang, Yuan Yuan, and Robby T Tan. Des3: Adaptive attention-driven self and soft shadow removal using vit similarity. InProceedings of the AAAI Conference on Artificial Intelligence, pages 2634–2642,
-
[14]
Shadow removal via shadow image decomposition
Hieu Le and Dimitris Samaras. Shadow removal via shadow image decomposition. InICCV, pages 8578–8587, 2019. 2
work page 2019
-
[15]
Regional atten- tion for shadow removal
Hengxing Liu, Mingjia Li, and Xiaojie Guo. Regional atten- tion for shadow removal. InProceedings of the 32nd ACM International Conference on Multimedia, pages 5949–5957,
-
[16]
Jiawei Liu, Qiang Wang, Huijie Fan, Wentao Li, Liangqiong Qu, and Yandong Tang. A decoupled multi-task network for shadow removal.IEEE Transactions on Multimedia, 25: 9449–9463, 2023. 8
work page 2023
-
[17]
Jiawei Liu, Qiang Wang, Huijie Fan, Jiandong Tian, and Yandong Tang. A shadow imaging bilinear model and three- branch residual network for shadow removal.IEEE Trans- actions on Neural Networks and Learning Systems, 35(11): 15857–15871, 2023. 8
work page 2023
-
[18]
Zhihao Liu, Hui Yin, Yang Mi, Mengyang Pu, and Song Wang. Shadow removal by a lightness-guided network with training on unpaired data.IEEE Transactions on Image Pro- cessing, 30:1853–1865, 2021. 8
work page 2021
-
[19]
From shadow generation to shadow removal
Zhihao Liu, Hui Yin, Xinyi Wu, Zhenyao Wu, Yang Mi, and Song Wang. From shadow generation to shadow removal. InProceedings of the IEEE/CVF conference on computer vi- sion and pattern recognition, pages 4927–4936, 2021. 8
work page 2021
-
[20]
Shuang Luo, Huifang Li, Yiqiu Li, Chenglin Shao, Huan- feng Shen, and Liangpei Zhang. An evolutionary shadow correction network and a benchmark uav dataset for remote sensing images.IEEE Transactions on Geoscience and Re- mote Sensing, 61:1–14, 2023. 2
work page 2023
-
[21]
Refusion: Enabling large-size realis- tic image restoration with latent-space diffusion models
Ziwei Luo, Fredrik K Gustafsson, Zheng Zhao, Jens Sj¨olund, and Thomas B Sch ¨on. Refusion: Enabling large-size realis- tic image restoration with latent-space diffusion models. In Proceedings of the IEEE/CVF conference on computer vi- sion and pattern recognition, pages 1680–1691, 2023. 8
work page 2023
-
[22]
Kangfu Mei, Luis Figueroa, Zhe Lin, Zhihong Ding, Scott Cohen, and Vishal M. Patel. Latent feature-guided diffusion models for shadow removal. In2024 IEEE/CVF Winter Con- ference on Applications of Computer Vision (WACV), pages 4301–4310, 2024. 8
work page 2024
-
[23]
DINOv2: Learning Robust Visual Features without Supervision
Maxime Oquab, Timothee Darcet, Theo Moutakanni, Huy V o, Marc Szafraniec, Vasil Khalidov, Pierre Fernandez, Daniel Haziza, Francisco Massa, Alaa El-Nouby, Mahmoud Assran, et al. Dinov2: Learning robust visual features with- out supervision.arXiv preprint arXiv:2304.07193, 2023. 2, 3
work page internal anchor Pith review Pith/arXiv arXiv 2023
-
[24]
Eric T. Reehorst and Philip Schniter. Regularization by de- noising: Clarifications and new interpretations.IEEE Trans- actions on Computational Imaging, 5(1):52–67, 2019. 2
work page 2019
-
[25]
The little engine that could: Regularization by denoising (red)
Yaniv Romano, Michael Elad, and Peyman Milanfar. The little engine that could: Regularization by denoising (red). SIAM Journal on Imaging Sciences, 10(4):1804–1844, 2017
work page 2017
-
[26]
Plug-and-play methods provably converge with properly trained denoisers
Ernest Ryu, Jialin Liu, Sicheng Wang, Xiaohan Chen, Zhangyang Wang, and Wotao Yin. Plug-and-play methods provably converge with properly trained denoisers. InICML, pages 5546–5557, 2019. 2
work page 2019
-
[27]
Guilherme F Silva, Grace B Carneiro, Ricardo Doth, Leonardo A Amaral, and Dario FG de Azevedo. Near real- time shadow detection and removal in aerial motion imagery application.ISPRS Journal of photogrammetry and remote sensing, 140:104–121, 2018. 8
work page 2018
-
[28]
Hong Ye Tan, Subhadip Mukherjee, and Junqi Tang. From image denoisers to regularizing imaging inverse problems: An overview.arXiv preprint arXiv:2509.03475, 2025. 2
-
[29]
Ntire 2023 image shadow removal challenge report
Florin-Alexandru Vasluianu, Tim Seizinger, and Radu Tim- ofte. Ntire 2023 image shadow removal challenge report. InProceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR) Workshops, pages 1788–1807, 2023. 3
work page 2023
-
[30]
Wsrd: A novel benchmark for high resolution image shadow removal
Florin-Alexandru Vasluianu, Tim Seizinger, and Radu Tim- ofte. Wsrd: A novel benchmark for high resolution image shadow removal. InProceedings of the IEEE/CVF Con- ference on Computer Vision and Pattern Recognition, pages 1826–1835, 2023. 2
work page 2023
-
[31]
Ntire 2024 im- age shadow removal challenge report
Florin-Alexandru Vasluianu, Tim Seizinger, Zhuyun Zhou, Zongwei Wu, Cailian Chen, Radu Timofte, Wei Dong, Han Zhou, Yuqiong Tian, Jun Chen, et al. Ntire 2024 im- age shadow removal challenge report. InProceedings of the IEEE/CVF conference on computer vision and pattern recognition, pages 6547–6570, 2024. 2, 4
work page 2024
-
[32]
NTIRE 2026 single im- age shadow removal challenge report
Florin-Alexandru Vasluianu, Tim Seizinger, Zhuyun Zhou, Zongwei WU, Radu Timofte, et al. NTIRE 2026 single im- age shadow removal challenge report. InProceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR) Workshops, 2026. 5, 6
work page 2026
-
[33]
Singanallur V . Venkatakrishnan, Charles A. Bouman, and Brendt Wohlberg. Plug-and-play priors for model based re- construction. In2013 IEEE Global Conference on Signal and Information Processing, pages 945–948, 2013. 2
work page 2013
-
[34]
Jifeng Wang, Xiang Li, and Jian Yang. Stacked condi- tional generative adversarial networks for jointly learning shadow detection and shadow removal. InProceedings of the IEEE conference on computer vision and pattern recog- nition, pages 1788–1797, 2018. 1, 2, 8
work page 2018
-
[35]
Zhou Wang, Alan C. Bovik, Hamid R. Sheikh, and Eero P. Simoncelli. Image quality assessment: From error visibility tostructural similarity.IEEE Transactions on Image Process- ing (TIP), 13(4):600–612, 2004. 5
work page 2004
-
[36]
Homoformer: Homogenized trans- former for image shadow removal
Jie Xiao, Xueyang Fu, Yurui Zhu, Dong Li, Jie Huang, Kai Zhu, and Zheng-Jun Zha. Homoformer: Homogenized trans- former for image shadow removal. InCVPR, pages 25617– 25626, 2024. 1, 2, 8
work page 2024
-
[37]
Omnisr: Shadow removal un- der direct and indirect lighting
Jiamin Xu, Zelong Li, Yuxin Zheng, Chenyu Huang, Renshu Gu, Weiwei Xu, and Gang Xu. Omnisr: Shadow removal un- der direct and indirect lighting. InAAAI, pages 8887–8895,
-
[38]
Detail-preserving latent diffusion for stable shadow removal
Jiamin Xu, Yuxin Zheng, Zelong Li, Chi Wang, Renshu Gu, Weiwei Xu, and Gang Xu. Detail-preserving latent diffusion for stable shadow removal. InProceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, pages 7592–7602, 2025. 5, 8
work page 2025
-
[39]
Lihe Yang, Bingyi Kang, Zilong Huang, Zhen Zhao, Xiao- gang Xu, Jiashi Feng, and Hengshuang Zhao. Depth any- thing v2.arXiv:2406.09414, 2024. 2, 3
work page internal anchor Pith review arXiv 2024
-
[40]
Efros, Eli Shecht- man, and Oliver Wang
Richard Zhang, Phillip Isola, Alexei A. Efros, Eli Shecht- man, and Oliver Wang. The unreasonable effectiveness of deep features as a perceptual metric. InIEEE/CVF Con- ference on Computer Vision and Pattern Recognition, pages 586–595, 2018. 4, 5
work page 2018
-
[41]
Wuming Zhang, Xi Zhao, Jean-Marie Morvan, and Liming Chen. Improving shadow suppression for illumination robust face recognition.IEEE Transactions on Pattern Analysis and Machine Intelligence, 41(3):611–624, 2018. 1, 2
work page 2018
-
[42]
Bijective mapping network for shadow removal
Yurui Zhu, Jie Huang, Xueyang Fu, Feng Zhao, Qibin Sun, and Zheng-Jun Zha. Bijective mapping network for shadow removal. InProceedings of the IEEE/CVF conference on computer vision and pattern recognition, pages 5627–5636,
-
[43]
Bidirectional resid- ual transformer for shadow removal
Yurui Zhu, Jie Xiao, Xueyang Fu, et al. Bidirectional resid- ual transformer for shadow removal. InECCV, 2022. 1, 2
work page 2022
discussion (0)
Sign in with ORCID, Apple, or X to comment. Anyone can read and Pith papers without signing in.