A Glass-Box Deep-Learning Method for Electrical Energy System Modeling Based on Kolmogorov-Arnold Network
Pith reviewed 2026-05-23 20:51 UTC · model grok-4.3
The pith
Kolmogorov-Arnold Networks produce explicit mathematical formulas for electrical energy system models while preserving nonlinear fitting power.
A machine-rendered reading of the paper's core claim, the machinery that carries it, and where it could break.
Core claim
KAN can express the physical process with concise and explicit mathematical formulas while retaining the nonlinear-fitting capability of deep neural networks, achieved through its architecture of learnable activation functions on network edges together with sparse training and a subsequent symbolification step that converts the model into symbolic form.
What carries the argument
Kolmogorov-Arnold Network using learnable univariate functions on edges, sparse regularization during training, and symbolification to extract explicit formulas from the trained structure.
If this is right
- Energy-system models become directly inspectable, allowing engineers to read off relationships such as voltage-current dependencies without post-hoc explanation tools.
- The same accuracy, robustness, and generalization performance reported for the three test systems can be expected when the method is applied to other electrical energy modeling tasks.
- Symbolification yields compact expressions that can be inserted into existing simulation or control software without retraining a neural network.
- Interpretability gains do not require sacrificing the ability to capture strong nonlinearities that defeat purely linear or low-order symbolic regression methods.
Where Pith is reading between the lines
- The approach could be tested on other physical domains that already possess known governing equations, such as mechanical or thermal systems, to measure how often the recovered symbols align with established laws.
- If the extracted formulas prove reliable, they could serve as starting points for hybrid physics-informed models that combine data-driven terms with first-principles constraints.
- Widespread adoption might reduce reliance on separate explainability techniques because the model itself is already in symbolic form.
Load-bearing premise
The symbolification step after sparse training recovers the true underlying physical relationships without introducing significant distortion or spurious terms.
What would settle it
Apply KAN to a simple, analytically known electrical system such as an RLC circuit, derive the symbolic formula, and check whether that formula matches the known differential equations to within measurement noise or produces large prediction errors on held-out data.
Figures
read the original abstract
Deep learning methods have been widely used as an end-to-end modeling strategy of electrical energy systems because of their conveniency and powerful pattern recognition capability. However, due to the "closed-box" nature, deep learning methods have long been blamed for their poor interpretability when modeling a physical system. In this paper, we introduce a novel neural network structure, Kolmogorov-Arnold Network (KAN), to achieve "glass-box" modeling for electrical energy systems to enhance the interpretability. The most distinct feature of KAN lies in the learnable activation function together with the sparse training and symbolification process. Consequently, KAN can express the physical process with concise and explicit mathematical formulas while remaining the nonlinear-fitting capability of deep neural networks. Simulation results based on three electrical energy systems demonstrate the effectiveness of KAN in the aspects of interpretability, accuracy, robustness and generalization ability.
Editorial analysis
A structured set of objections, weighed in public.
Referee Report
Summary. The manuscript proposes Kolmogorov-Arnold Networks (KAN) as a glass-box deep-learning approach for modeling electrical energy systems. It highlights the use of learnable univariate activation functions, sparse training, and a subsequent symbolification step to derive concise explicit mathematical formulas that represent the underlying physical processes, while retaining the nonlinear approximation power of neural networks. Effectiveness is asserted via simulations on three electrical energy systems in terms of interpretability, accuracy, robustness, and generalization.
Significance. If the central claims hold, the work would offer a practically useful bridge between black-box neural modeling and explicit symbolic representations in power-system applications, potentially enabling domain-expert inspection of learned dynamics without sacrificing fitting flexibility.
major comments (2)
- [Abstract] Abstract: the central claim that 'KAN can express the physical process with concise and explicit mathematical formulas' while accurately representing the systems rests on the symbolification step after sparse training, yet the abstract supplies no quantitative metrics, error bars, baseline comparisons, or validation against known physical equations for the three systems.
- [Method] Method description of symbolification (likely §3): the assumption that thresholding and basis-function fitting on the learned univariate functions recovers the true underlying relationships without systematic mismatch or artifacts is load-bearing for the glass-box interpretability claim, but the manuscript provides no explicit check (e.g., substitution of the derived symbolic expressions back into the system dynamics or residual analysis against the original nonlinear terms).
minor comments (1)
- [Method] Notation for the KAN layers and the symbolification threshold should be defined once and used consistently; currently the transition from numerical activations to symbolic expressions is described only qualitatively.
Simulated Author's Rebuttal
We thank the referee for the constructive and detailed comments, which help improve the clarity and rigor of our manuscript. We address each major comment point by point below.
read point-by-point responses
-
Referee: [Abstract] Abstract: the central claim that 'KAN can express the physical process with concise and explicit mathematical formulas' while accurately representing the systems rests on the symbolification step after sparse training, yet the abstract supplies no quantitative metrics, error bars, baseline comparisons, or validation against known physical equations for the three systems.
Authors: We agree that the abstract would be strengthened by including quantitative support for the claims. In the revised manuscript, we have updated the abstract to report key accuracy metrics (MSE with standard deviations from repeated trials), baseline comparisons against standard neural networks, and explicit references to validation of the derived symbolic expressions against the known physical equations of the three systems. revision: yes
-
Referee: [Method] Method description of symbolification (likely §3): the assumption that thresholding and basis-function fitting on the learned univariate functions recovers the true underlying relationships without systematic mismatch or artifacts is load-bearing for the glass-box interpretability claim, but the manuscript provides no explicit check (e.g., substitution of the derived symbolic expressions back into the system dynamics or residual analysis against the original nonlinear terms).
Authors: The referee is correct that an explicit post-symbolification verification step would strengthen the interpretability claims. While the original simulations already demonstrate low fitting errors and physical consistency, we have added to the revised manuscript a dedicated verification subsection. This includes substituting the symbolic expressions back into the system dynamics and performing residual analysis against the original nonlinear terms for all three case studies, confirming negligible systematic mismatch. revision: yes
Circularity Check
No circularity: KAN application imports external architecture and demonstrates via simulation
full rationale
The paper applies the Kolmogorov-Arnold Network (introduced in independent prior work) to electrical energy systems. Its central claims rest on the properties of sparse training and symbolification as defined in that external reference, followed by empirical validation on three systems. No load-bearing step reduces by the paper's own equations or self-citation to a fitted input or self-defined quantity; the derivation chain is self-contained against external benchmarks.
Axiom & Free-Parameter Ledger
Reference graph
Works this paper leans on
-
[1]
Grid-Connected Energy Storage Systems: State-of- the-Art and Emerging Technologies,
G. G. Farivar et al., “Grid-Connected Energy Storage Systems: State-of- the-Art and Emerging Technologies,” Proc. IEEE, vol. 111, no. 4, pp. 397–420, Apr. 2023, doi: 10.1109/JPROC.2022.3183289
-
[2]
A Review of Power Electronics for Grid Connection of Utility-Scale Battery Energy Storage Systems,
G. Wang et al., “A Review of Power Electronics for Grid Connection of Utility-Scale Battery Energy Storage Systems,” IEEE Trans. Sustain. Energy, vol. 7, no. 4, pp. 1778– 1790, Oct. 2016, doi: 10.1109/TSTE.2016.2586941
-
[3]
Time- Domain Parameter Extraction Method for Thévenin -Equivalent Circuit Battery Models,
A. Hentunen, T. Lehmuspelto, and J. Suomela, “Time- Domain Parameter Extraction Method for Thévenin -Equivalent Circuit Battery Models,” IEEE Trans. Energy Convers., vol. 29, no. 3, pp. 558–566, Sep. 2014, doi: 10.1109/TEC.2014.2318205
-
[4]
X. Liu, W. Li, and A. Zhou, “PNGV Equivalent Circuit Model and SOC Estimation Algorithm for Lithium Battery Pack Adopted in AGV Vehicle,” IEEE Access , vol. 6, pp. 23639 –23647, 2018, doi: 10.1109/ACCESS.2018.2812421
-
[5]
A Data- Driven Modeling Method of Virtual Synchronous Generator Based on LSTM Neural Network,
J. Tian, G. Zeng, J. Zhao, X. Zhu, and Z. Zhang, “A Data- Driven Modeling Method of Virtual Synchronous Generator Based on LSTM Neural Network,” IEEE Trans. Ind. Inf. , vol. 20, no. 4, pp. 5428– 5439, Apr. 2024, doi: 10.1109/TII.2023.3333673
-
[6]
Neural Network Design for Impedance Modeling of Power Electronic Systems Based on Latent Features,
Y. Liao et al., “Neural Network Design for Impedance Modeling of Power Electronic Systems Based on Latent Features,” IEEE Trans. Neural Netw. Learning Syst., vol. 35, no. 5, pp. 5968 –5980, May 2024, doi: 10.1109/TNNLS.2023.3235806
-
[7]
A Novel Equivalent Model of Active Distribution Networks Based on LSTM,
C. Zheng et al. , “A Novel Equivalent Model of Active Distribution Networks Based on LSTM,” IEEE Trans. Neural Netw. Learning Syst. , vol. 30, no. 9, pp. 2611 –2624, Sep. 2019, doi: 10.1109/TNNLS.2018. 2885219
-
[8]
A. Vaswani et al., “Attention is all you need,” in Proc. ACM NIPS, vol. 30, 2017, pp. 6000–6010
work page 2017
-
[9]
X. Su, C. Deng, Y. Shan, F. Shahnia, Y. Fu, and Z. Dong, “Fault Diagnosis Based on Interpretable Convolutional Temporal -spatial Attention Network for Offshore Wind Turbines,” JOURNAL OF MODERN POWER SYSTEMS AND CLEAN ENERGY
-
[10]
DAMGAT -Based Interpretable Detection of False Data Injection Attacks in Smart Grids,
X. Su et al., “DAMGAT -Based Interpretable Detection of False Data Injection Attacks in Smart Grids,” IEEE Trans. Smart Grid , vol. 15, no. 4, pp. 4182–4195, Jul. 2024, doi: 10.1109/TSG.2024.3364665
-
[11]
Applications of Physics -Informed Neural Networks in Power Systems - A Review,
B. Huang and J. Wang, “Applications of Physics -Informed Neural Networks in Power Systems - A Review,” IEEE Trans. Power Syst., vol. 38, no. 1, pp. 572–588, Jan. 2023, doi: 10.1109/TPWRS.2022.3162473
-
[12]
Inherently Interpretable Physics -Informed Neural Network for Battery Modeling and Prognosis,
F. Wang et al. , “Inherently Interpretable Physics -Informed Neural Network for Battery Modeling and Prognosis,” IEEE Trans. Neural Netw. Learning Syst. , pp. 1 –15, 2023, doi: 10.1109/TNNLS.2023.3329368
-
[13]
Parameter Estimation of Power Electronic Converters with Physics-Informed Machine Learning,
[S. Zhao, Y. Peng, Y. Zhang, and H. Wang, “Parameter Estimation of Power Electronic Converters with Physics-Informed Machine Learning,” IEEE Trans. Power Electron. , vol. 37, no. 10, pp. 11567– 11578, Oct. 2022, doi: 10.1109/TPEL.2022.3176468
-
[14]
D. Gaspar, P. Silva, and C. Silva, “Explainable AI for Intrusion Detection Systems: LIME and SHAP Applicability on Multi -Layer Perceptron,” IEEE Access , vol. 12, pp. 30164 –30175, 2024, doi: 10.1109/ACCESS.2024.3368377
-
[15]
Explaining the Explainer: A First Theoretical Analysis of LIME
D. Garreau, “Explaining the Explainer: A First Theoretical Analysis of LIME”
-
[16]
KAN: Kolmogorov-Arnold Networks
Z. Liu et al. , “KAN: Kolmogorov -Arnold Networks,” Jun. 16, 2024, arXiv:2404.19756. Available: http://arxiv.org/abs/2404.19756
work page internal anchor Pith review Pith/arXiv arXiv 2024
-
[17]
A. N. Kolmogorov, “On the representation of continuous functions of many variables by superposition of continuous functions of one variable and addition,” in American Mathematical Society Translations: Series 2, vol. 28, Providence, Rhode Island: American Mathematical Society, 1963, pp. 55–59. doi: 10.1090/trans2/028/04
-
[18]
Efficiency -Optimized High-Current Dual Active Bridge Converter for Automotive Applications,
F. Krismer and J. W. Kolar, “Efficiency -Optimized High-Current Dual Active Bridge Converter for Automotive Applications,” IEEE Trans. Ind. Electron., vol. 59, no. 7, pp. 2745 –2760, Jul. 2012, doi: 10.1109/TIE.2011.2112312
-
[19]
Deadbeat Control with Bivariate Online Parameter Identification for SPS -Modulated DAB Converters,
T.-Q. Duong and S. -J. Choi, “Deadbeat Control with Bivariate Online Parameter Identification for SPS -Modulated DAB Converters,” IEEE Access, vol. 10, pp. 54079 –54090, 2022, doi: 10.1109/ACCESS.2022.3176428
-
[20]
X. Chen, G. Xu, H. Han, D. Liu, Y. Sun, and M. Su, “Light -Load Efficiency Enhancement of High -Frequency Dual -Active-Bridge Converter Under SPS Control,” IEEE Trans. Ind. Electron. , vol. 68, no. 12, pp. 12941–12946, Dec. 2021, doi: 10.1109/TIE.2020.3044803
-
[21]
R. Dubey et al. , “Measurement of temperature coefficient of photovoltaic modules in field and comparison with laboratory measurements,” in 2015 IEEE 42nd Photovoltaic Specialist Conference (PVSC), New Orleans, LA: IEEE, Jun. 2015, pp. 1 –5. doi: 10.1109/PVSC.2015.7355852
-
[22]
Long Short -Term Memory Networks for Accurate State -of-Charge Estimation of Li-ion Batteries,
E. Chemali, P. J. Kollmeyer, M. Preindl, R. Ahmed, and A. Emadi, “Long Short -Term Memory Networks for Accurate State -of-Charge Estimation of Li-ion Batteries,” IEEE Trans. Ind. Electron. , vol. 65, no. 8, pp. 6730–6739, Aug. 2018, doi: 10.1109/TIE.2017.2787586
-
[23]
State- of-charge estimation of lithium-ion batteries based on gated recurrent neural network,
F. Yang, W. Li, C. Li, and Q. Miao, “State- of-charge estimation of lithium-ion batteries based on gated recurrent neural network,” Energy, vol. 175, pp. 66–75, May 2019, doi: 10.1016/j.energy.2019.03.059
-
[24]
C. Bian, H. He, and S. Yang, “Stacked bidirectional long short -term memory networks for state-of-charge estimation of lithium-ion batteries,” Energy, vol. 191, p. 116538, Jan. 2020, doi: 10.1016/j.energy.2019.116538
-
[25]
Panasonic 18650PF Li -ion Battery Data
K. Phillip, “Panasonic 18650PF Li -ion Battery Data.” doi: 10.17632/wykht8y7tg.1
-
[26]
“Yulara Solar System.” DKA solar centre. Available: http://dkasolarcentre.com.au/download/notes-on-the-data/
-
[27]
Contributions to the mathematical theory of evolution,
K. Pearson, “Contributions to the mathematical theory of evolution,” Philosophical Transactions of the Royal Society of London. A , vol. 185, pp. 71–110, 1894
-
[28]
Correlation calculated from faulty data,
C. Spearman, “Correlation calculated from faulty data,” British journal of psychology, vol. 3, no. 3, p. 271, 1910
work page 1910
-
[29]
A new measure of rank correlation,
M. G. Kendall, “A new measure of rank correlation,” Biometrika, vol. 30, no. 1–2, pp. 81–93, 1938
work page 1938
-
[30]
Factorial Sampling Plans for Preliminary Computational Experiments,
M. D. Morris, “Factorial Sampling Plans for Preliminary Computational Experiments,” vol. 33, no. 2, 1991
work page 1991
discussion (0)
Sign in with ORCID, Apple, or X to comment. Anyone can read and Pith papers without signing in.