pith. sign in

arxiv: 2507.09503 · v4 · submitted 2025-07-13 · 📡 eess.SY · cs.LG· cs.SY

Neural Two-Stage Stochastic Optimization for Solving Unit Commitment Problem

Pith reviewed 2026-05-19 05:23 UTC · model grok-4.3

classification 📡 eess.SY cs.LGcs.SY
keywords neural network approximationtwo-stage stochastic optimizationunit commitmentrecourse functionmixed-integer linear programmingscenario reductionpower system scheduling
0
0 comments X

The pith

A deep neural network approximates recourse costs in two-stage stochastic unit commitment so the first-stage problem can be solved directly as a mixed-integer linear program.

A machine-rendered reading of the paper's core claim, the machinery that carries it, and where it could break.

The paper shows how to train a neural network to predict the cost of adjusting generation after the first-stage commitment decisions are fixed and uncertainty is revealed. This trained network is then inserted as linear constraints inside the original unit-commitment optimization, turning the whole stochastic problem into a single, solvable MILP. A second network compresses any number of uncertainty scenarios into a fixed-size feature vector, so the size of the final model does not grow with the number of scenarios. Experiments on standard IEEE test systems report solutions whose total cost is within 1 percent of the optimum while running orders of magnitude faster than conventional solvers.

Core claim

The method trains a deep neural network to map first-stage commitment decisions together with uncertainty features directly to the optimal recourse cost of the second-stage problem; once trained, this network is encoded as a set of mixed-integer linear constraints and inserted into the first-stage unit-commitment formulation, while a separate scenario-embedding network reduces an arbitrary collection of uncertainty realizations to a compact feature vector that keeps the overall model size independent of the scenario count.

What carries the argument

Deep neural network that maps commitment decisions and uncertainty features to recourse costs, encoded as MILP constraints and combined with a scenario-embedding network for dimensionality reduction.

If this is right

  • The final optimization model stays the same size no matter how many uncertainty scenarios are considered.
  • Operational constraints such as ramping limits and transmission capacities remain explicitly enforced inside the first-stage MILP.
  • Computation time scales far better than scenario-based decomposition or full MILP formulations on the IEEE 5-, 30-, and 118-bus systems.
  • The same trained network can be reused across different numbers of scenarios without retraining.

Where Pith is reading between the lines

These are editorial extensions of the paper, not claims the author makes directly.

  • The same embedding technique could be tried on other two-stage stochastic programs that have expensive recourse subproblems.
  • If the network approximation error can be bounded, the method might give provable guarantees on solution quality rather than only empirical gaps.
  • Real-time market clearing with hundreds of renewable scenarios becomes feasible if the network training is performed offline.

Load-bearing premise

The trained network must predict recourse costs accurately enough on new scenarios that the resulting first-stage solution stays close to optimal and satisfies all operational limits.

What would settle it

Run the trained model on a new set of scenarios or a larger test system and measure whether the obtained first-stage solution violates any operational constraint or produces a total expected cost more than 1 percent above the true optimum.

Figures

Figures reproduced from arXiv: 2507.09503 by Jingtao Qin, Nanpeng Yu, Zhentong Shao.

Figure 1
Figure 1. Figure 1: Architecture diagram of the proposed neural network. [PITH_FULL_IMAGE:figures/full_fig_p005_1.png] view at source ↗
Figure 2
Figure 2. Figure 2: Optimality gap evolution over time for the 118-bus system with 100 [PITH_FULL_IMAGE:figures/full_fig_p009_2.png] view at source ↗
Figure 3
Figure 3. Figure 3: The MIP gap distribution of the 118-bus system under 100 stochastic [PITH_FULL_IMAGE:figures/full_fig_p009_3.png] view at source ↗
read the original abstract

This paper proposes a neural stochastic optimization method for efficiently solving the two-stage stochastic unit commitment (2S-SUC) problem under high-dimensional uncertainty scenarios. The proposed method approximates the second-stage recourse problem using a deep neural network trained to map commitment decisions and uncertainty features to recourse costs. The trained network is subsequently embedded into the first-stage UC problem as a mixed-integer linear program (MILP), allowing for explicit enforcement of operational constraints while preserving the key uncertainty characteristics. A scenario-embedding network is employed to enable dimensionality reduction and feature aggregation across arbitrary scenario sets, serving as a data-driven scenario reduction mechanism. Numerical experiments on IEEE 5-bus, 30-bus, and 118-bus systems demonstrate that the proposed neural two-stage stochastic optimization method achieves solutions with an optimality gap of less than 1%, while enabling orders-of-magnitude speedup compared to conventional MILP solvers and decomposition-based methods. Moreover, the model's size remains constant regardless of the number of scenarios, offering significant scalability for large-scale stochastic unit commitment problems.

Editorial analysis

A structured set of objections, weighed in public.

Desk editor's note, referee report, simulated authors' rebuttal, and a circularity audit. Tearing a paper down is the easy half of reading it; the pith above is the substance, this is the friction.

Referee Report

2 major / 1 minor

Summary. The paper proposes a neural two-stage stochastic optimization method for the two-stage stochastic unit commitment (2S-SUC) problem. A deep neural network approximates the second-stage recourse costs from commitment decisions and uncertainty features; this network is encoded as MILP constraints and embedded in the first-stage problem. A scenario-embedding network performs data-driven dimensionality reduction across arbitrary scenario sets. Experiments on IEEE 5-, 30-, and 118-bus systems report optimality gaps below 1 % and orders-of-magnitude speedups relative to MILP solvers and decomposition methods, with model size independent of scenario count.

Significance. If the central performance claims are substantiated by exact out-of-sample verification, the approach would offer a scalable, constant-complexity surrogate for large-scale stochastic unit commitment that preserves operational constraints while achieving near-optimal first-stage decisions. This could materially advance real-time stochastic scheduling under high-dimensional uncertainty.

major comments (2)
  1. [Numerical Experiments] Numerical Experiments section: the reported optimality gaps below 1 % are obtained by solving the first-stage problem with the embedded neural surrogate; the manuscript does not state whether the true expected cost of the resulting commitment schedule was recomputed by solving the exact second-stage recourse problems on held-out scenarios. Without this verification, the gap figures may reflect surrogate objective values rather than true optimality gaps, directly undermining the central performance claim.
  2. [Method] Method section (neural recourse approximation and MILP embedding): no explicit approximation-error bounds, worst-case analysis of the ReLU/MILP encoding, or empirical feasibility checks on unseen scenario sets are provided. The central claim that the embedded network yields operationally feasible and near-optimal first-stage decisions therefore rests on unverified assumptions about surrogate accuracy.
minor comments (1)
  1. [Abstract] Abstract: the statement that 'the model's size remains constant regardless of the number of scenarios' should clarify whether this refers to the number of neural-network parameters or to the size of the resulting MILP.

Simulated Author's Rebuttal

2 responses · 1 unresolved

We thank the referee for the constructive and insightful comments. We address each major comment below, clarifying our experimental verification procedures and indicating where revisions will be made to strengthen the presentation of results and methods.

read point-by-point responses
  1. Referee: [Numerical Experiments] Numerical Experiments section: the reported optimality gaps below 1 % are obtained by solving the first-stage problem with the embedded neural surrogate; the manuscript does not state whether the true expected cost of the resulting commitment schedule was recomputed by solving the exact second-stage recourse problems on held-out scenarios. Without this verification, the gap figures may reflect surrogate objective values rather than true optimality gaps, directly undermining the central performance claim.

    Authors: We agree that explicit out-of-sample verification using exact second-stage solutions is necessary to substantiate the optimality gap claims. In our experiments, after obtaining first-stage commitment decisions from the MILP with the embedded neural surrogate, we recomputed the true expected recourse cost by solving the exact second-stage problems on held-out scenarios that were not used during training or validation. The reported gaps below 1% are with respect to these true costs. We will revise the Numerical Experiments section to explicitly describe this verification procedure, including details on the held-out scenario sets and how the true optimality gaps were calculated. revision: yes

  2. Referee: [Method] Method section (neural recourse approximation and MILP embedding): no explicit approximation-error bounds, worst-case analysis of the ReLU/MILP encoding, or empirical feasibility checks on unseen scenario sets are provided. The central claim that the embedded network yields operationally feasible and near-optimal first-stage decisions therefore rests on unverified assumptions about surrogate accuracy.

    Authors: We acknowledge that the manuscript does not include theoretical approximation-error bounds or a worst-case analysis of the ReLU/MILP encoding. Such bounds are difficult to obtain for general neural approximations of recourse functions and are outside the scope of this primarily empirical study. However, we provide extensive numerical validation on multiple IEEE test systems using unseen scenarios. To strengthen the manuscript, we will add a dedicated subsection reporting empirical feasibility checks on unseen scenario sets, including the percentage of feasible first-stage solutions obtained and any observed constraint violations. Operational feasibility of first-stage decisions is enforced directly by the MILP formulation of the unit commitment constraints. revision: partial

standing simulated objections not resolved
  • Derivation of explicit approximation-error bounds or worst-case analysis for the neural recourse approximation and its ReLU/MILP encoding.

Circularity Check

0 steps flagged

No circularity: empirical validation on standard test systems is independent of fitted surrogate

full rationale

The paper describes training a DNN to approximate recourse costs from commitment decisions and uncertainty features, embedding the trained network (plus scenario-embedding subnetwork) as MILP constraints in the first-stage UC problem, and then reporting numerical optimality gaps and runtimes on IEEE 5/30/118-bus instances against MILP solvers and decomposition methods. These performance numbers are obtained by solving the surrogate-augmented first-stage problem and comparing its objective and feasibility to reference solutions; they do not reduce by construction to the training data or to any self-citation chain. The method is a standard data-driven surrogate approach whose central claims rest on external benchmark comparisons rather than on any redefinition of inputs as outputs.

Axiom & Free-Parameter Ledger

1 free parameters · 1 axioms · 1 invented entities

The central claim rests on the accuracy of a data-driven neural approximation and on the assumption that this approximation can be faithfully converted into linear constraints without destroying feasibility or optimality properties.

free parameters (1)
  • Neural network weights and biases
    Trained on simulated recourse data to map commitment decisions and uncertainty features to costs.
axioms (1)
  • domain assumption A trained feed-forward network can be exactly or approximately encoded as a set of mixed-integer linear constraints
    Invoked when the trained network is embedded into the first-stage MILP.
invented entities (1)
  • Scenario-embedding network no independent evidence
    purpose: Dimensionality reduction and feature aggregation across arbitrary scenario sets
    Introduced as a data-driven scenario reduction mechanism.

pith-pipeline@v0.9.0 · 5711 in / 1391 out tokens · 78614 ms · 2026-05-19T05:23:48.404988+00:00 · methodology

discussion (0)

Sign in with ORCID, Apple, or X to comment. Anyone can read and Pith papers without signing in.

Reference graph

Works this paper leans on

33 extracted references · 33 canonical work pages · 1 internal anchor

  1. [1]

    Stochastic security-constrained unit commitment,

    L. Wu, M. Shahidehpour, and T. Li, “Stochastic security-constrained unit commitment,” IEEE Trans. Power Syst. , vol. 22, no. 2, pp. 800– 811, 2007

  2. [2]

    Frauendorfer, Stochastic two-stage programming

    K. Frauendorfer, Stochastic two-stage programming. Springer Science & Business Media, 2012, vol. 392

  3. [3]

    Network-constrained AC unit com- mitment under uncertainty: A Benders’ decomposition approach,

    A. Nasri, S. J. Kazempour et al. , “Network-constrained AC unit com- mitment under uncertainty: A Benders’ decomposition approach,” IEEE Trans. Power Syst., vol. 31, no. 1, pp. 412–422, 2015

  4. [4]

    Solution sensitivity-based scenario reduction for stochastic unit commitment,

    Y . Feng and S. M. Ryan, “Solution sensitivity-based scenario reduction for stochastic unit commitment,” Comput. Manag. Sci., vol. 13, pp. 29– 62, 2016

  5. [5]

    Stochastic optimization for unit commitment—A review,

    Q. P. Zheng, J. Wang, and A. L. Liu, “Stochastic optimization for unit commitment—A review,” IEEE Trans. Power Syst. , vol. 30, no. 4, pp. 1913–1924, 2014

  6. [6]

    Transmission constrained UC with wind power: An all-scenario-feasible MILP formulation with strong nonanticipativity,

    Q. Zhai, X. Li, X. Lei, and X. Guan, “Transmission constrained UC with wind power: An all-scenario-feasible MILP formulation with strong nonanticipativity,” IEEE Trans. Power Syst. , vol. 32, no. 3, pp. 1805– 1817, 2016

  7. [7]

    Chance-constrained two-stage unit commitment under uncertain load and wind power output using bilinear Benders decomposition,

    Y . Zhang, J. Wang, B. Zeng, and Z. Hu, “Chance-constrained two-stage unit commitment under uncertain load and wind power output using bilinear Benders decomposition,” IEEE Trans. Power Syst. , vol. 32, no. 5, pp. 3637–3647, 2017

  8. [8]

    Column-and-constraint generation approach to partition-based risk-averse two-stage stochastic programs,

    J. Lee and K. Lee, “Column-and-constraint generation approach to partition-based risk-averse two-stage stochastic programs,” Ann. Oper. Res., pp. 1–31, 2025

  9. [9]

    Data-driven stochastic unit commitment for integrating wind generation,

    C. Zhao and Y . Guan, “Data-driven stochastic unit commitment for integrating wind generation,” IEEE Trans. Power Syst. , vol. 31, no. 4, pp. 2587–2596, 2015

  10. [10]

    Scenario map based stochastic unit commitment,

    E. Du, N. Zhang, C. Kang, and Q. Xia, “Scenario map based stochastic unit commitment,” IEEE Trans. Power Syst. , vol. 33, no. 5, pp. 4694– 4705, 2018

  11. [11]

    An efficient robust solution to the two- stage stochastic unit commitment problem,

    I. Blanco and J. M. Morales, “An efficient robust solution to the two- stage stochastic unit commitment problem,” IEEE Trans. Power Syst. , vol. 32, no. 6, pp. 4477–4488, 2017

  12. [12]

    Stochastic distributionally robust unit commitment with deep scenario clustering,

    J. Zhang, B. Wang, and J. Watada, “Stochastic distributionally robust unit commitment with deep scenario clustering,”Electr. Power Syst. Res., vol. 224, p. 109710, 2023

  13. [13]

    Comparison of scenario reduction techniques for the stochastic unit commitment,

    Y . Dvorkin, Y . Wang, H. Pandzic, and D. Kirschen, “Comparison of scenario reduction techniques for the stochastic unit commitment,” in 2014 IEEE PES general meeting , pp. 1–5

  14. [14]

    A predictive prescription framework for stochastic unit commitment using boosting ensemble learning algo- rithms,

    O. Yurdakul, F. Qiu et al. , “A predictive prescription framework for stochastic unit commitment using boosting ensemble learning algo- rithms,” IEEE Trans. Power Syst., vol. 39, no. 4, pp. 5726–5740, 2023

  15. [15]

    Towards improving unit commitment eco- nomics: An add-on tailor for renewable energy and reserve predictions,

    X. Chen, Y . Liu, and L. Wu, “Towards improving unit commitment eco- nomics: An add-on tailor for renewable energy and reserve predictions,” IEEE Trans. Sustain. Energy , vol. 15, no. 4, pp. 2547–2566, 2024

  16. [16]

    Physics-informed graph neural networks for collaborative dynamic reconfiguration and voltage regulation in unbalanced distribution systems,

    J. Qin, R. Yang, and N. Yu, “Physics-informed graph neural networks for collaborative dynamic reconfiguration and voltage regulation in unbalanced distribution systems,” IEEE Trans. Ind. Appl., vol. 61, no. 2, pp. 2538–2548, 2025

  17. [17]

    A reinforcement learning embedded sur- rogate Lagrangian relaxation method for fast solving unit commitment problems,

    Y . Zhu, G. Cui, A. Liu et al., “A reinforcement learning embedded sur- rogate Lagrangian relaxation method for fast solving unit commitment problems,” IEEE Trans. Power Syst. , 2025

  18. [18]

    Extended Benders decomposition for two-stage SCUC,

    C. Liu, M. Shahidehpour, and L. Wu, “Extended Benders decomposition for two-stage SCUC,” IEEE Trans. Power Syst., vol. 25, no. 2, pp. 1192– 1194, 2010

  19. [19]

    A decomposition approach to the two-stage stochastic unit commitment problem,

    Q. P. Zheng, J. Wang, P. M. Pardalos, and Y . Guan, “A decomposition approach to the two-stage stochastic unit commitment problem,” Ann. Oper. Res., vol. 210, pp. 387–410, 2013

  20. [20]

    Maximum resilience of artificial neural networks,

    C.-H. Cheng, G. N ¨uhrenberg, and H. Ruess, “Maximum resilience of artificial neural networks,” in Automated Technology for Verification and Analysis: 15th International Symposium . Springer, 2017, pp. 251–268

  21. [21]

    Evaluating Robustness of Neural Networks with Mixed Integer Programming

    V . Tjeng, K. Xiao, and R. Tedrake, “Evaluating robustness of neural networks with mixed integer programming,” arXiv preprint arXiv:1711.07356, 2017

  22. [22]

    Deep neural networks and mixed integer linear optimization,

    M. Fischetti and J. Jo, “Deep neural networks and mixed integer linear optimization,” Constraints, vol. 23, no. 3, pp. 296–309, 2018

  23. [23]

    ReLU networks as surrogate models in mixed-integer linear programs,

    B. Grimstad and H. Andersson, “ReLU networks as surrogate models in mixed-integer linear programs,” Comput. Chem. Eng. , vol. 131, p. 106580, 2019

  24. [24]

    Strong mixed-integer programming formulations for trained neural networks,

    R. Anderson, J. Huchette, W. Ma, C. Tjandraatmadja, and J. P. Vielma, “Strong mixed-integer programming formulations for trained neural networks,” Math. Program., vol. 183, no. 1, pp. 3–39, 2020

  25. [25]

    OMLT: Optimization & Machine Learning Toolkit,

    F. Ceccon, J. Jalving, J. Haddad, A. Thebelt, C. Tsay, C. D. Laird, and R. Misener, “OMLT: Optimization & Machine Learning Toolkit,” J. Mach. Learn. Res. , vol. 23, no. 349, pp. 1–8, 2022

  26. [26]

    Formulating data-driven surrogate models for process optimization,

    R. Misener and L. Biegler, “Formulating data-driven surrogate models for process optimization,” Comput. Chem. Eng. , vol. 179, p. 108411, 2023

  27. [27]

    Beyond price taker: Conceptual design and optimization of integrated energy systems using machine learning market surrogates,

    J. Jalving, J. Ghouse et al. , “Beyond price taker: Conceptual design and optimization of integrated energy systems using machine learning market surrogates,” Appl. Energy, vol. 351, p. 121767, 2023

  28. [28]

    Optimal energy system scheduling using a constraint-aware reinforcement learning algorithm,

    H. Shengren, P. P. Vergara et al. , “Optimal energy system scheduling using a constraint-aware reinforcement learning algorithm,”Int. J. Electr. Power Energy Syst., vol. 152, p. 109230, 2023

  29. [29]

    Neur2SP: Neural two-stage stochastic programming,

    R. M. Patel, J. Dumouchelle, E. Khalil, and M. Bodur, “Neur2SP: Neural two-stage stochastic programming,” Adv. Neural Inf. Process. Syst., vol. 35, pp. 23 992–24 005, 2022

  30. [30]

    A comprehensive survey on design and application of autoencoder in deep learning,

    P. Li, Y . Pei, and J. Li, “A comprehensive survey on design and application of autoencoder in deep learning,” Appl. Soft Comput. , vol. 138, p. 110176, 2023

  31. [31]

    MATPOWER: Steady- state operations, planning, and analysis tools for power systems research and education,

    R. D. Zimmerman, C. E. Murillo-S ´anchez et al., “MATPOWER: Steady- state operations, planning, and analysis tools for power systems research and education,”IEEE Trans. Power Syst., vol. 26, no. 1, pp. 12–19, 2010

  32. [32]

    Pyomo: modeling and solving mathematical programs in Python,

    W. E. Hart, J.-P. Watson, and D. L. Woodruff, “Pyomo: modeling and solving mathematical programs in Python,” Math. Program. Comput. , vol. 3, pp. 219–260, 2011

  33. [33]

    Solving two-stage robust optimization problems using a column-and-constraint generation method,

    B. Zeng and L. Zhao, “Solving two-stage robust optimization problems using a column-and-constraint generation method,” Oper. Res. Lett. , vol. 41, no. 5, pp. 457–461, 2013