pith. sign in

arxiv: 2606.29430 · v1 · pith:KK6SAXRQnew · submitted 2026-06-28 · 💻 cs.CV

EvLIR: Learning Illumination Residuals from Ordered Events for Low-Light Image Enhancement

Pith reviewed 2026-06-30 07:48 UTC · model grok-4.3

classification 💻 cs.CV
keywords low-light image enhancementevent cameraillumination residualtemporal modelingRetinexConvGRUevent voxelimage restoration
0
0 comments X

The pith

Preserving the ordered temporal bins of event data allows a ConvGRU module to generate bounded illumination corrections that improve Retinex-based low-light image restoration.

A machine-rendered reading of the paper's core claim, the machinery that carries it, and where it could break.

The paper presents EvLIR as a framework that takes a low-light image and its aligned event voxel to address missing structure, noise, and weak contrast. It keeps the temporal order of event bins instead of flattening them into a static stack and routes the sequence through a Temporal Event Residual Module that uses a lightweight ConvGRU to extract short-window dynamics. The extracted state becomes a bounded illumination correction that supplies spatially adaptive guidance to a Retinex-style estimator and a reliability-aware restoration stage. A sympathetic reader would care because the method shows how high-temporal-resolution event observations can supply photometric information that conventional frames lack.

Core claim

EvLIR preserves the ordered temporal bins of the event stream and introduces a Temporal Event Residual Module (TERM) to encode short-window event dynamics with a lightweight ConvGRU. The resulting temporal state is converted into a bounded illumination correction, which provides spatially adaptive photometric guidance for Retinex-style illumination estimation and subsequent reliability-aware image-event restoration.

What carries the argument

The Temporal Event Residual Module (TERM), which encodes ordered short-window event dynamics with a ConvGRU to produce a bounded illumination correction for Retinex estimation.

If this is right

  • The method records the best score on eleven of twelve dataset-metric pairs across SDE and SDSD indoor and outdoor sets.
  • Average performance reaches 25.63 dB PSNR, 28.30 dB PSNR*, and 0.827 SSIM over the four benchmarks.
  • The bounded correction supplies spatially adaptive photometric guidance that standard Retinex pipelines lack.
  • Reliability-aware fusion of the image and event-derived correction reduces artifacts from saturated noise and missing structure.

Where Pith is reading between the lines

These are editorial extensions of the paper, not claims the author makes directly.

  • The same ordered-event residual idea could be tested on video sequences to enforce temporal consistency across frames.
  • Because the correction is bounded, the module might be inserted as a lightweight plug-in into existing Retinex networks without retraining the entire pipeline.
  • Real-world event streams with higher noise or larger motion could be used to check whether the ConvGRU state remains stable outside the controlled benchmarks.
  • The approach implies that event cameras could serve as an auxiliary sensor for mobile low-light capture rather than requiring new hardware.

Load-bearing premise

That keeping the ordered temporal bins of events and encoding them with a ConvGRU will produce a bounded illumination correction that meaningfully improves Retinex-style estimation.

What would settle it

An ablation that replaces the ordered TERM module with an unordered event stack and shows no gain or a drop in PSNR and SSIM on the SDE and SDSD benchmarks would falsify the claim.

Figures

Figures reproduced from arXiv: 2606.29430 by Ali Anaissi, Chuanzhi Xu, Haodong Chen, Haoxian Zhou, Langyi Chen, Pengfei Ye, Qiang Qu, Weidong Cai.

Figure 1
Figure 1. Figure 1: Temporal-residual enhancement in EvLIR. A low-light frame provides an unreliable snapshot, while ordered event bins expose short-window brightness-change trends. The Temporal Event Residual Module (TERM) summarizes these temporal cues and predicts a bounded illumination correction for low-light restoration. Abstract Low-light image enhancement is severely ill-posed when the input frame contains missing str… view at source ↗
Figure 2
Figure 2. Figure 2: Architecture of EvLIR. Ordered event bins are processed by the Temporal Event Residual Module to predict a bounded illumination correction. The corrected illumination drives Retinex light-up, and reliability-aware image-event restoration produces the final image. use 3 × 3 convolutions. The temporal state Hk after the last ordered bin summarizes the short-term event dynamics inside the window. This sequent… view at source ↗
Figure 3
Figure 3. Figure 3: Qualitative comparison on SDE-in. Input Event eSL-Net SNR-Net Retinexformer EvLight Ours GT [PITH_FULL_IMAGE:figures/full_fig_p006_3.png] view at source ↗
Figure 4
Figure 4. Figure 4: Qualitative comparison on SDE-out. and the Structural Similarity Index Measure (SSIM) [32] as image restoration metrics. More experimental details, comparison-scope discussion, and additional results are provided in section A of the supplementary material. Implementation Details. All images are evaluated at 260 × 346 resolution. For EvLIR, the event voxel has 32 channels and is split into K = 8 ordered tem… view at source ↗
Figure 5
Figure 5. Figure 5: Qualitative comparison on SDSD-in. Input Event eSL-Net SNR-Net Retinexformer EvLight Ours GT [PITH_FULL_IMAGE:figures/full_fig_p007_5.png] view at source ↗
Figure 6
Figure 6. Figure 6: Qualitative comparison on SDSD-out. and SSIM, suggesting stronger exposure-normalized fi￾delity and structural similarity. 4.3. Qualitative Comparison Figures 3–6 show qualitative comparisons on randomly se￾lected test samples from each dataset. Compared with image-only and event-guided baselines, EvLIR restores clearer structural boundaries and preserves more fine de￾tails in low-light regions. The result… view at source ↗
Figure 7
Figure 7. Figure 7: Visualization of learned illumination residuals on three [PITH_FULL_IMAGE:figures/full_fig_p008_7.png] view at source ↗
Figure 8
Figure 8. Figure 8: Frame-event alignment and temporal bin density for a real low-light capture. The event window is attached to the target frame and [PITH_FULL_IMAGE:figures/full_fig_p012_8.png] view at source ↗
Figure 9
Figure 9. Figure 9: Event diagnostics for an SDE-in sample: raw asynchronous events, positive/negative accumulation, and the eight ordered bins [PITH_FULL_IMAGE:figures/full_fig_p013_9.png] view at source ↗
Figure 10
Figure 10. Figure 10: From ordered event bins to illumination-residual diagnostics. The first two rows show the eight event groups in temporal order, [PITH_FULL_IMAGE:figures/full_fig_p014_10.png] view at source ↗
Figure 11
Figure 11. Figure 11: Base illumination, temporal residual, corrected illumination, and ConvGRU feature energy for the same sample. [PITH_FULL_IMAGE:figures/full_fig_p014_11.png] view at source ↗
Figure 12
Figure 12. Figure 12: Low-light input, Retinex light-up image, and the SNR reliability map used during restoration. [PITH_FULL_IMAGE:figures/full_fig_p015_12.png] view at source ↗
Figure 13
Figure 13. Figure 13: Fine-detail comparisons on all four benchmarks. Each dataset contains two regions cropped from the same sample. Relative to [PITH_FULL_IMAGE:figures/full_fig_p015_13.png] view at source ↗
Figure 14
Figure 14. Figure 14: Real low-light captures with event activity and SDE-in outputs, part 1. The columns show the low-light input crop, cropped [PITH_FULL_IMAGE:figures/full_fig_p016_14.png] view at source ↗
Figure 15
Figure 15. Figure 15: Real low-light captures with event activity and SDE-in outputs, part 2. The examples complement Fig. [PITH_FULL_IMAGE:figures/full_fig_p017_15.png] view at source ↗
read the original abstract

Low-light image enhancement is severely ill-posed when the input frame contains missing structure, saturated noise, and weak local contrast. Event cameras provide asynchronous brightness-change observations with high temporal resolution, but prior works often treat voxel channels as an unordered or static feature stack before fusion, rather than explicitly modeling their within-window temporal evolution, weakening the temporal evidence that makes events useful. We propose EvLIR, a temporal-residual enhancement framework that learns illumination residuals from ordered events for low-light image enhancement. Given a low-light frame and its aligned event voxel, EvLIR preserves the ordered temporal bins of the event stream and introduces a Temporal Event Residual Module (TERM) to encode short-window event dynamics with a lightweight ConvGRU. The resulting temporal state is converted into a bounded illumination correction, which provides spatially adaptive photometric guidance for Retinex-style illumination estimation and subsequent reliability-aware image-event restoration. On SDE and SDSD indoor/outdoor benchmarks, EvLIR achieves the best result on eleven of twelve dataset-metric pairs, with average scores of 25.63~dB PSNR, 28.30~dB PSNR*, and 0.827 SSIM across the four benchmarks.

Editorial analysis

A structured set of objections, weighed in public.

Desk editor's note, referee report, simulated authors' rebuttal, and a circularity audit. Tearing a paper down is the easy half of reading it; the pith above is the substance, this is the friction.

Referee Report

2 major / 2 minor

Summary. The paper proposes EvLIR, a temporal-residual framework for low-light image enhancement that takes a low-light frame and aligned event voxel as input. It preserves the ordered temporal bins of the event stream and introduces a Temporal Event Residual Module (TERM) that encodes short-window dynamics via a lightweight ConvGRU; the resulting state is mapped to a bounded illumination correction that supplies spatially adaptive guidance to a Retinex-style estimator followed by reliability-aware restoration. On the SDE and SDSD indoor/outdoor benchmarks the method reports the best score on eleven of twelve dataset-metric pairs, with aggregate figures of 25.63 dB PSNR, 28.30 dB PSNR*, and 0.827 SSIM.

Significance. If the performance numbers are reproducible and the temporal modeling component is shown to be responsible for the gains, the work would supply a concrete mechanism for exploiting the high-temporal-resolution brightness-change signal of event cameras inside an illumination-estimation pipeline. The explicit use of ordered bins and a recurrent state (ConvGRU) distinguishes the approach from prior event-voxel methods that collapse the temporal axis before fusion.

major comments (2)
  1. [Experiments section] Ablation experiments (Experiments section): no table or figure isolates the contribution of ordered temporal bins plus ConvGRU inside TERM. A controlled comparison that (i) shuffles the temporal order, (ii) replaces ConvGRU with static aggregation, or (iii) disables the residual path while freezing the rest of the pipeline is required to attribute the reported 11/12 best scores to the stated mechanism rather than to the restoration network or training protocol.
  2. [§3] §3 (Method), TERM description: the conversion of the ConvGRU hidden state into a “bounded illumination correction” is asserted but the bounding operation, its range, and the precise way it is injected into the Retinex illumination estimator are not formalized (no equation or pseudocode). Without this, it is unclear whether the correction is a true residual or an arbitrary additive field.
minor comments (2)
  1. [Abstract] Abstract and §4: reported averages lack error bars or per-benchmark standard deviations; inclusion would strengthen the claim that the method is consistently superior across the four benchmarks.
  2. [Table 1] Figure captions and Table 1: dataset sizes (number of frames or sequences per indoor/outdoor split) and exact metric definitions (PSNR vs. PSNR*) should be stated explicitly for reproducibility.

Simulated Author's Rebuttal

2 responses · 0 unresolved

We thank the referee for the constructive comments, which highlight areas where the manuscript can be strengthened through additional experiments and formalization. We address each major comment below.

read point-by-point responses
  1. Referee: [Experiments section] Ablation experiments (Experiments section): no table or figure isolates the contribution of ordered temporal bins plus ConvGRU inside TERM. A controlled comparison that (i) shuffles the temporal order, (ii) replaces ConvGRU with static aggregation, or (iii) disables the residual path while freezing the rest of the pipeline is required to attribute the reported 11/12 best scores to the stated mechanism rather than to the restoration network or training protocol.

    Authors: We agree that the current manuscript does not include ablations that isolate the ordered temporal bins and ConvGRU within TERM. In the revised version we will add a dedicated ablation table in the Experiments section reporting results for (i) temporally shuffled event bins, (ii) replacement of ConvGRU by static aggregation (e.g., mean or max pooling across bins), and (iii) removal of the residual path while keeping the remainder of the pipeline fixed. These controlled comparisons will clarify the contribution of the temporal modeling components to the reported performance. revision: yes

  2. Referee: [§3] §3 (Method), TERM description: the conversion of the ConvGRU hidden state into a “bounded illumination correction” is asserted but the bounding operation, its range, and the precise way it is injected into the Retinex illumination estimator are not formalized (no equation or pseudocode). Without this, it is unclear whether the correction is a true residual or an arbitrary additive field.

    Authors: We acknowledge that the conversion step from the ConvGRU hidden state to the bounded illumination correction is described only at a high level and lacks equations. In the revised manuscript we will insert formal equations in Section 3 that define (a) the bounding function applied to the hidden state (including its explicit range), and (b) the precise additive injection of this correction into the Retinex illumination estimator. This will make explicit that the output functions as a spatially adaptive residual term. revision: yes

Circularity Check

0 steps flagged

No circularity; empirical architecture with no self-referential derivations

full rationale

The paper proposes a neural architecture (EvLIR with TERM module) that processes ordered event voxels via ConvGRU to produce an illumination residual for Retinex-style enhancement. No equations, uniqueness theorems, or derivations appear in the provided text that reduce any claimed output to a fitted input or self-citation by construction. Benchmark results (11/12 best scores on SDE/SDSD) are presented as empirical outcomes on external datasets, not as consequences of a closed definitional loop. The central mechanism is a designed module whose contribution would require ablation for attribution but does not exhibit circularity in its formulation.

Axiom & Free-Parameter Ledger

0 free parameters · 2 axioms · 0 invented entities

The approach rests on the domain assumption that event data can be reliably aligned to image frames and that short-window temporal dynamics captured by ConvGRU provide photometric guidance beyond what static event representations supply.

axioms (2)
  • domain assumption Event voxels are temporally ordered and aligned with the input low-light frame
    The method description presupposes aligned event data whose temporal order is preserved.
  • domain assumption Retinex-style illumination estimation benefits from an external bounded correction derived from event dynamics
    The pipeline routes the temporal state into Retinex estimation without independent justification visible in the abstract.

pith-pipeline@v0.9.1-grok · 5769 in / 1407 out tokens · 35946 ms · 2026-06-30T07:48:51.576997+00:00 · methodology

discussion (0)

Sign in with ORCID, Apple, or X to comment. Anyone can read and Pith papers without signing in.

Reference graph

Works this paper leans on

50 extracted references · 4 canonical work pages · 1 internal anchor

  1. [1]

    A dynamic his- togram equalization for image contrast enhancement.IEEE transactions on consumer electronics, 53(2):593–600, 2007

    Mohammad Abdullah-Al-Wadud, Md Hasanul Kabir, M Ali Akber Dewan, and Oksam Chae. A dynamic his- togram equalization for image contrast enhancement.IEEE transactions on consumer electronics, 53(2):593–600, 2007. 2

  2. [2]

    A his- togram modification framework and its application for image contrast enhancement.IEEE Transactions on image process- ing, 18(9):1921–1935, 2009

    Tarik Arici, Salih Dikbas, and Yucel Altunbasak. A his- togram modification framework and its application for image contrast enhancement.IEEE Transactions on image process- ing, 18(9):1921–1935, 2009. 2

  3. [3]

    Drwkv: Focusing on object edges for low-light image en- hancement

    Xuecheng Bai, Yuxiang Wang, Boyu Hu, Qinyuan Jie, Chuanzhi Xu, Kechen Li, Hongru Xiao, and Vera Chung. Drwkv: Focusing on object edges for low-light image en- hancement. InProceedings of the IEEE/CVF Winter Confer- ence on Applications of Computer Vision, pages 1554–1564,

  4. [4]

    Evrwkv: A continuous interactive rwkv frame- work for effective event-guided low-light image enhance- ment.IEEE Transactions on Circuits and Systems for Video Technology, 2026

    Wenjie Cai, Qingguo Meng, Zhenyu Wang, Xingbo Dong, and Zhe Jin. Evrwkv: A continuous interactive rwkv frame- work for effective event-guided low-light image enhance- ment.IEEE Transactions on Circuits and Systems for Video Technology, 2026. 11

  5. [5]

    Retinexformer: One-stage retinex- based transformer for low-light image enhancement

    Yuanhao Cai, Hao Bian, Jing Lin, Haoqian Wang, Radu Tim- ofte, and Yulun Zhang. Retinexformer: One-stage retinex- based transformer for low-light image enhancement. InPro- ceedings of the IEEE/CVF international conference on com- puter vision, pages 12504–12513, 2023. 2, 5

  6. [6]

    Contextual and variational contrast enhancement.IEEE Transactions on Image Process- ing, 20(12):3431–3441, 2011

    Turgay Celik and Tardi Tjahjadi. Contextual and variational contrast enhancement.IEEE Transactions on Image Process- ing, 20(12):3431–3441, 2011. 2

  7. [7]

    Fracevent: Event-camera simulation via fractional-relaxation pixel dynamics, 2026

    Langyi Chen, Chuanzhi Xu, Haoxian Zhou, Pengfei Ye, Ziyu Luo, Haodong Chen, Qiang Qu, Xiaoming Chen, and Weidong Cai. Fracevent: Event-camera simulation via fractional-relaxation pixel dynamics, 2026. 2

  8. [8]

    Event-based vision: A survey.IEEE transactions on pattern analysis and machine intelligence, 44(1):154–180, 2020

    Guillermo Gallego, Tobi Delbr ¨uck, Garrick Orchard, Chiara Bartolozzi, Brian Taba, Andrea Censi, Stefan Leutenegger, Andrew J Davison, J ¨org Conradt, Kostas Daniilidis, et al. Event-based vision: A survey.IEEE transactions on pattern analysis and machine intelligence, 44(1):154–180, 2020. 2

  9. [9]

    Low-latency auto- motive vision with event cameras.Nature, 629(8014):1034– 1040, 2024

    Daniel Gehrig and Davide Scaramuzza. Low-latency auto- motive vision with event cameras.Nature, 629(8014):1034– 1040, 2024. 2

  10. [10]

    Low cost and latency event camera background activity denoising.IEEE Transactions on Pattern Analysis and Machine Intelligence, 45(1):785– 795, 2022

    Shasha Guo and Tobi Delbruck. Low cost and latency event camera background activity denoising.IEEE Transactions on Pattern Analysis and Machine Intelligence, 45(1):785– 795, 2022. 2

  11. [11]

    Lime: Low-light im- age enhancement via illumination map estimation.IEEE Transactions on image processing, 26(2):982–993, 2016

    Xiaojie Guo, Yu Li, and Haibin Ling. Lime: Low-light im- age enhancement via illumination map estimation.IEEE Transactions on image processing, 26(2):982–993, 2016. 2

  12. [12]

    Eretinex: Event camera meets retinex theory for low-light image enhancement

    Xuejian Guo, Zhiqiang Tian, Yuehang Wang, Siqi Li, Yu Jiang, Shaoyi Du, and Yue Gao. Eretinex: Event camera meets retinex theory for low-light image enhancement. In 2025 IEEE International Conference on Robotics and Au- tomation (ICRA), pages 3759–3765. IEEE, 2025. 3

  13. [13]

    v2e: From video frames to realistic dvs events

    Yuhuang Hu, Shih-Chii Liu, and Tobi Delbruck. v2e: From video frames to realistic dvs events. InProceedings of the IEEE/CVF conference on computer vision and pattern recognition, pages 1312–1321, 2021. 11

  14. [14]

    Scope of va- lidity of psnr in image/video quality assessment.Electronics letters, 44(13):800–801, 2008

    Quan Huynh-Thu and Mohammed Ghanbari. Scope of va- lidity of psnr in image/video quality assessment.Electronics letters, 44(13):800–801, 2008. 5

  15. [15]

    Enlightengan: Deep light enhancement without paired supervision.IEEE transactions on image processing, 30:2340–2349, 2021

    Yifan Jiang, Xinyu Gong, Ding Liu, Yu Cheng, Chen Fang, Xiaohui Shen, Jianchao Yang, Pan Zhou, and Zhangyang Wang. Enlightengan: Deep light enhancement without paired supervision.IEEE transactions on image processing, 30:2340–2349, 2021. 2

  16. [16]

    Event-based low-illumination image enhancement.IEEE Transactions on Multimedia, 26:1920– 1931, 2023

    Yu Jiang, Yuehang Wang, Siqi Li, Yongji Zhang, Minghao Zhao, and Yue Gao. Event-based low-illumination image enhancement.IEEE Transactions on Multimedia, 26:1920– 1931, 2023. 2, 3, 5

  17. [17]

    Event-guided low light image enhancement via a dual branch gan.Journal of Visual Communication and Image Represen- tation, 95:103887, 2023

    Haiyan Jin, Qiaobin Wang, Haonan Su, and Zhaolin Xiao. Event-guided low light image enhancement via a dual branch gan.Journal of Visual Communication and Image Represen- tation, 95:103887, 2023. 3

  18. [18]

    Towards robust event-guided low-light image enhancement: a large-scale real-world event-image dataset and novel approach

    Guoqiang Liang, Kanghao Chen, Hangyu Li, Yunfan Lu, and Lin Wang. Towards robust event-guided low-light image enhancement: a large-scale real-world event-image dataset and novel approach. InProceedings of the IEEE/CVF Con- ference on Computer Vision and Pattern Recognition, pages 23–33, 2024. 2, 3, 5

  19. [19]

    Towards robust event-guided low-light image enhancement: a large-scale real-world event-image dataset and novel approach

    Guoqiang Liang, Kanghao Chen, Hangyu Li, Yunfan Lu, and Lin Wang. Towards robust event-guided low-light image enhancement: a large-scale real-world event-image dataset and novel approach. InProceedings of the IEEE/CVF Con- ference on Computer Vision and Pattern Recognition, pages 23–33, 2024. 11

  20. [20]

    Chunxiao Liu, Zelong Wang, Philip Birch, and Xun Wang. Efficient retinex-based framework for low-light image en- hancement without additional networks.IEEE Transactions on Circuits and Systems for Video Technology, 35(5):4896– 4909, 2024. 2

  21. [21]

    Low-light video enhancement with synthetic event guidance

    Lin Liu, Junfeng An, Jianzhuang Liu, Shanxin Yuan, Xi- angyu Chen, Wengang Zhou, Houqiang Li, Yan Feng Wang, and Qi Tian. Low-light video enhancement with synthetic event guidance. InProceedings of the AAAI Conference on Artificial Intelligence, pages 1692–1700, 2023. 5

  22. [22]

    Ll- net: A deep autoencoder approach to natural low-light image enhancement.Pattern Recognition, 61:650–662, 2017

    Kin Gwn Lore, Adedotun Akintayo, and Soumik Sarkar. Ll- net: A deep autoencoder approach to natural low-light image enhancement.Pattern Recognition, 61:650–662, 2017. 2

  23. [23]

    See: See everything every time–adaptive brightness adjustment for broad light range images via events.arXiv preprint arXiv:2502.21120, 2025

    Yunfan Lu, Xiaogang Xu, Hao Lu, Yanlin Qian, Pengteng Li, Huizai Yao, Bin Yang, Junyi Li, Qianyi Cai, Weiyu Guo, et al. See: See everything every time–adaptive brightness adjustment for broad light range images via events.arXiv preprint arXiv:2502.21120, 2025. 3

  24. [24]

    Bringing a blurry frame alive at high frame-rate with an event camera

    Liyuan Pan, Cedric Scheerlinck, Xin Yu, Richard Hartley, Miaomiao Liu, and Yuchao Dai. Bringing a blurry frame alive at high frame-rate with an event camera. InProceedings of the IEEE/CVF conference on computer vision and pattern recognition, pages 6820–6829, 2019. 3

  25. [25]

    Gwretinex-net: Gray world retinex network for low-light image enhancement.IEEE Transac- tions on Circuits and Systems for Video Technology, 2025

    Hu Qiang, Yuzhong Zhong, Yiwei Liao, Xingxing You, Yuqi Zhu, and Songyi Dian. Gwretinex-net: Gray world retinex network for low-light image enhancement.IEEE Transac- tions on Circuits and Systems for Video Technology, 2025. 2

  26. [26]

    Reducing the sim-to-real gap for event cam- eras

    Timo Stoffregen, Cedric Scheerlinck, Davide Scaramuzza, Tom Drummond, Nick Barnes, Lindsay Kleeman, and Robert Mahony. Reducing the sim-to-real gap for event cam- eras. InEuropean Conference on Computer Vision, pages 534–549. Springer, 2020. 3, 5

  27. [27]

    Low-light image enhancement using event-based illumination estimation

    Lei Sun, Yuhan Bao, Jiajun Zhai, Jingyun Liang, Yu- lun Zhang, Kaiwei Wang, Danda Pani Paudel, and Luc Van Gool. Low-light image enhancement using event-based illumination estimation. InProceedings of the IEEE/CVF International Conference on Computer Vision, pages 6667– 6677, 2025. 3

  28. [28]

    Event enhanced high-quality image recovery

    Bishan Wang, Jingwei He, Lei Yu, Gui-Song Xia, and Wen Yang. Event enhanced high-quality image recovery. In European Conference on Computer Vision, pages 155–171. Springer, 2020. 5

  29. [29]

    Seeing dynamic scene in the dark: A high-quality video dataset with mechatronic alignment

    Ruixing Wang, Xiaogang Xu, Chi-Wing Fu, Jiangbo Lu, Bei Yu, and Jiaya Jia. Seeing dynamic scene in the dark: A high-quality video dataset with mechatronic alignment. In Proceedings of the IEEE/CVF international conference on computer vision, pages 9700–9709, 2021. 5

  30. [30]

    Seeing dynamic scene in the dark: A high-quality video dataset with mechatronic alignment

    Ruixing Wang, Xiaogang Xu, Chi-Wing Fu, Jiangbo Lu, Bei Yu, and Jiaya Jia. Seeing dynamic scene in the dark: A high-quality video dataset with mechatronic alignment. In Proceedings of the IEEE/CVF international conference on computer vision, pages 9700–9709, 2021. 11

  31. [31]

    Ultra-high-definition low-light image enhancement: A benchmark and transformer-based method

    Tao Wang, Kaihao Zhang, Tianrun Shen, Wenhan Luo, Bjorn Stenger, and Tong Lu. Ultra-high-definition low-light image enhancement: A benchmark and transformer-based method. InProceedings of the AAAI conference on artificial intelli- gence, pages 2654–2662, 2023. 2

  32. [32]

    Image quality assessment: from error visibility to structural similarity.IEEE transactions on image processing, 13(4):600–612, 2004

    Zhou Wang, Alan C Bovik, Hamid R Sheikh, and Eero P Si- moncelli. Image quality assessment: from error visibility to structural similarity.IEEE transactions on image processing, 13(4):600–612, 2004. 6

  33. [33]

    Uformer: A general u-shaped transformer for image restoration

    Zhendong Wang, Xiaodong Cun, Jianmin Bao, Wengang Zhou, Jianzhuang Liu, and Houqiang Li. Uformer: A general u-shaped transformer for image restoration. InProceedings of the IEEE/CVF conference on computer vision and pattern recognition, pages 17683–17693, 2022. 5

  34. [34]

    Deep Retinex Decomposition for Low-Light Enhancement

    Chen Wei, Wenjing Wang, Wenhan Yang, and Jiaying Liu. Deep retinex decomposition for low-light enhancement. arXiv preprint arXiv:1808.04560, 2018. 2

  35. [35]

    Learning semantic-aware knowledge guidance for low-light image enhancement

    Yuhui Wu, Chen Pan, Guoqing Wang, Yang Yang, Jiwei Wei, Chongyi Li, and Heng Tao Shen. Learning semantic-aware knowledge guidance for low-light image enhancement. In Proceedings of the IEEE/CVF conference on computer vi- sion and pattern recognition, pages 1662–1671, 2023. 5

  36. [36]

    Event- gan: An unsupervised low-light grayscale image enhance- ment method based on event camera.IEEE Sensors Journal,

    Zehao Wu, Yuanqing Xia, Rui Hu, and Runze Gao. Event- gan: An unsupervised low-light grayscale image enhance- ment method based on event camera.IEEE Sensors Journal,

  37. [37]

    Towards end-to-end neuromorphic voxel- based 3d object reconstruction without physical priors

    Chuanzhi Xu, Langyi Chen, Haodong Chen, Vera Chung, and Vincent Qu. Towards end-to-end neuromorphic voxel- based 3d object reconstruction without physical priors. In 2025 IEEE International Conference on Multimedia and Expo (ICME), pages 1–6, 2025. 2

  38. [38]

    A survey of 3d reconstruction with event cameras.Computational Visual Media, (99):1–37,

    Chuanzhi Xu, Haoxian Zhou, Langyi Chen, Haodong Chen, Zeke Zexi Hu, Zhicheng Lu, Ying Zhou, Vera Chung, Qiang Qu, and Weidong Cai. A survey of 3d reconstruction with event cameras.Computational Visual Media, (99):1–37,

  39. [39]

    Ultralight polarity-split neuromor- phic snn for event-stream super-resolution

    Chuanzhi Xu, Haoxian Zhou, Langyi Chen, Yuk Ying Chung, and Qiang Qu. Ultralight polarity-split neuromor- phic snn for event-stream super-resolution. InProceedings of the AAAI Conference on Artificial Intelligence, pages 11196– 11204, 2026. 2

  40. [40]

    Star: A struc- ture and texture aware retinex model.IEEE Transactions on Image Processing, 29:5022–5037, 2020

    Jun Xu, Yingkun Hou, Dongwei Ren, Li Liu, Fan Zhu, Mengyang Yu, Haoqian Wang, and Ling Shao. Star: A struc- ture and texture aware retinex model.IEEE Transactions on Image Processing, 29:5022–5037, 2020. 2

  41. [41]

    Event-illumination collab- orative low-light image enhancement with a high-resolution real-world dataset

    Senyan Xu, Zhijing Sun, Kean Liu, Xin Lu, Ruixuan Jiang, Xueyang Fu, and Zheng-Jun Zha. Event-illumination collab- orative low-light image enhancement with a high-resolution real-world dataset. InProceedings of the IEEE/CVF Con- ference on Computer Vision and Pattern Recognition, pages 22270–22280, 2026. 11

  42. [42]

    Snr-aware low-light image enhancement

    Xiaogang Xu, Ruixing Wang, Chi-Wing Fu, and Jiaya Jia. Snr-aware low-light image enhancement. InProceedings of the IEEE/CVF conference on computer vision and pattern recognition, pages 17714–17724, 2022. 5

  43. [43]

    Bievlight: Bi-level learning of task-aware event refinement for low-light image enhancement

    Zishu Yao, Xiang-Xiang Su, Shengning Zhou, Guang-Yong Chen, Guodong Fan, and Xing Chen. Bievlight: Bi-level learning of task-aware event refinement for low-light image enhancement. InProceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, pages 41541– 41550, 2026. 11

  44. [44]

    Diff-retinex: Rethinking low-light image enhance- ment with a generative diffusion model

    Xunpeng Yi, Han Xu, Hao Zhang, Linfeng Tang, and Ji- ayi Ma. Diff-retinex: Rethinking low-light image enhance- ment with a generative diffusion model. InProceedings of the IEEE/CVF international conference on computer vision, pages 12302–12311, 2023. 2

  45. [45]

    Learning to see in the dark with events

    Song Zhang, Yu Zhang, Zhe Jiang, Dongqing Zou, Jimmy Ren, and Bin Zhou. Learning to see in the dark with events. InEuropean Conference on Computer Vision, pages 666–

  46. [46]

    Springer, 2020. 2, 3

  47. [47]

    Kindling the darkness: A practical low-light image enhancer

    Yonghua Zhang, Jiawan Zhang, and Xiaojie Guo. Kindling the darkness: A practical low-light image enhancer. InPro- ceedings of the 27th ACM international conference on mul- timedia, pages 1632–1640, 2019. 2

  48. [48]

    Retinexdip: A unified deep framework for low-light image enhancement.IEEE Transactions on Cir- cuits and Systems for Video Technology, 32(3):1076–1088,

    Zunjin Zhao, Bangshu Xiong, Lei Wang, Qiaofeng Ou, Lei Yu, and Fa Kuang. Retinexdip: A unified deep framework for low-light image enhancement.IEEE Transactions on Cir- cuits and Systems for Video Technology, 32(3):1076–1088,

  49. [49]

    Exploiting spatiotemporal properties for efficient event-driven human pose estimation.arXiv preprint arXiv:2512.06306, 2025

    Haoxian Zhou, Chuanzhi Xu, Langyi Chen, Pengfei Ye, Haodong Chen, Yuk Ying Chung, and Qiang Qu. Exploiting spatiotemporal properties for efficient event-driven human pose estimation.arXiv preprint arXiv:2512.06306, 2025. 2

  50. [50]

    Event-based stereo visual odometry.IEEE Transactions on Robotics, 37 (5):1433–1450, 2021

    Yi Zhou, Guillermo Gallego, and Shaojie Shen. Event-based stereo visual odometry.IEEE Transactions on Robotics, 37 (5):1433–1450, 2021. 2 Technical Appendices and Supplementary Material A. Additional Experimental Details Dataset protocol.The experiments use the indoor and outdoor subsets of SDE [19] and SDSD [30]. SDE contains 91 spatially and temporally ...