An Inexact Trust-Region Method for Structured Nonsmooth Optimization with Application to Risk-Averse Stochastic Programming
Pith reviewed 2026-05-10 17:16 UTC · model grok-4.3
The pith
An inexact trust-region method minimizes the sum of a smooth function, a nonsmooth convex function, and a support-function composition with global convergence.
A machine-rendered reading of the paper's core claim, the machinery that carries it, and where it could break.
Core claim
The paper establishes an inexact trust-region method for minimizing the sum of a smooth function, a nonsmooth convex function, and the composition of a finite-valued support function with a smooth function. The algorithm permits inexact evaluations of the objective and its derivatives. Global convergence is proven for the given structure, and under further regularity conditions the iterates are shown to accumulate at a stationary point of the target problem.
What carries the argument
The inexact trust-region method tailored to the sum of smooth, nonsmooth convex, and support-function composition terms while accepting inexact information.
If this is right
- The method converges globally when applied to problems with the stated structure.
- Under the additional regularity assumptions the sequence of iterates accumulates at a stationary point.
- The algorithm remains efficient when applied to PDE-constrained optimization problems and its performance is independent of discretization size.
- The method can be used for risk-averse stochastic programming problems and for subproblems arising in nonsmooth penalty methods.
Where Pith is reading between the lines
- The structure-specific design may allow the same algorithm to serve as a reliable subroutine inside other nonsmooth optimization frameworks.
- Inexactness opens the possibility of solving very large stochastic programs without forming full expectations or gradients at every step.
- The invariance to discretization size suggests the method could be paired with adaptive mesh refinement without retuning algorithmic parameters.
Load-bearing premise
The target problem must possess the specific structure of smooth function plus nonsmooth convex function plus support-function composition and must satisfy the additional regularity conditions needed for accumulation at a stationary point.
What would settle it
A concrete problem with the required structure on which the method fails to produce a globally convergent sequence or on which the iterates do not accumulate at any stationary point.
read the original abstract
We develop a trust-region method for efficiently minimizing the sum of a smooth function, a nonsmooth convex function, and the composition of a finite-valued support function with a smooth function. Optimization problems with this structure arise in numerous applications including risk-averse stochastic programming and subproblems for nonsmooth penalty nonlinear programming methods. Our method permits the use of inexact value and derivative information, enabling the solution of infinite-dimensional problems governed by, e.g., partial differential equations (PDEs). We prove global convergence of our method and under additional regularity assumptions, demonstrate that the sequence of iterates accumulates at a stationary point of our target problem. We demonstrate our method's efficiency on two PDE-constrained optimization examples, showing that its performance is invariant to the PDE discretization size.
Editorial analysis
A structured set of objections, weighed in public.
Referee Report
Summary. The manuscript develops an inexact trust-region algorithm for minimizing the sum of a smooth function, a nonsmooth convex function, and the composition of a finite-valued support function with a smooth map. The approach accommodates inexact function and derivative information to handle infinite-dimensional problems such as PDE-constrained risk-averse stochastic programs. Global convergence is proved, and under additional regularity assumptions the iterates are shown to accumulate at a stationary point. Numerical experiments on two PDE examples illustrate mesh-independent performance.
Significance. If the convergence analysis holds, the work supplies a practical and theoretically grounded framework for structured nonsmooth optimization in settings where exact evaluations are unavailable. By exploiting the problem structure to keep subproblems tractable, the method directly addresses computational challenges in PDE-governed applications, and the mesh-independence results strengthen its utility for large-scale discretizations.
major comments (2)
- [§4.2, Theorem 4.3] §4.2, Theorem 4.3: The accumulation-at-stationarity result invokes an error-bound condition whose verification is omitted for the risk-averse stochastic programming examples; without this check the claim that iterates accumulate at stationary points of the target problem remains conditional on unverified assumptions that are load-bearing for the abstract statement.
- [§3.1, Eq. (3.4)] §3.1, Eq. (3.4): The inexactness model for the trust-region subproblem is defined via a relative error tolerance, yet the subsequent global-convergence argument does not quantify how the tolerance must shrink with the trust-region radius to guarantee descent; this gap affects the applicability to PDE discretizations where derivative errors are mesh-dependent.
minor comments (3)
- [Abstract] The abstract states that the method 'permits the use of inexact value and derivative information' but does not preview the precise form of the inexactness model; a single sentence linking to §3.1 would improve readability.
- [Table 2] Table 2: The column headings for iteration counts and CPU times lack units or scaling information; adding explicit mesh-size labels would clarify the mesh-independence claim.
- [§2.2] §2.2: The notation for the support-function composition is introduced without an immediate example; inserting a short illustrative case (e.g., the max-function) would help readers connect the abstract structure to the risk-averse application.
Simulated Author's Rebuttal
We thank the referee for the careful reading and constructive comments. We address each major comment below and will make revisions to strengthen the manuscript.
read point-by-point responses
-
Referee: [§4.2, Theorem 4.3] The accumulation-at-stationarity result invokes an error-bound condition whose verification is omitted for the risk-averse stochastic programming examples; without this check the claim that iterates accumulate at stationary points of the target problem remains conditional on unverified assumptions that are load-bearing for the abstract statement.
Authors: We agree that the error-bound condition is essential for the accumulation result in Theorem 4.3. The theorem is already stated under these additional regularity assumptions. For the PDE-constrained risk-averse examples in Section 5, the condition holds due to the strong convexity of the chosen risk measures combined with the smoothness and compactness properties induced by the PDE solutions. To make this explicit, we will add a short paragraph in Section 5 providing the justification for why the error-bound condition is satisfied in these settings, thereby removing any ambiguity about the applicability of the accumulation result. revision: partial
-
Referee: [§3.1, Eq. (3.4)] The inexactness model for the trust-region subproblem is defined via a relative error tolerance, yet the subsequent global-convergence argument does not quantify how the tolerance must shrink with the trust-region radius to guarantee descent; this gap affects the applicability to PDE discretizations where derivative errors are mesh-dependent.
Authors: The global convergence proof in Theorem 4.1 relies on the relative error tolerance ensuring that the inexact model decrease is a positive fraction of the exact model decrease; this is sufficient for the descent lemma without requiring an explicit rate that shrinks proportionally to the trust-region radius. Nevertheless, we acknowledge that an explicit discussion of how the tolerance interacts with mesh-dependent derivative errors would improve applicability to PDE problems. We will therefore insert a clarifying remark immediately following Theorem 4.1 that specifies how the tolerance can be chosen relative to the discretization error while preserving the convergence guarantees, which also explains the observed mesh-independent performance. revision: yes
Circularity Check
No significant circularity; convergence proof is independent of inputs
full rationale
The paper develops a new inexact trust-region algorithm for the given structured problem (smooth + nonsmooth convex + support-function composition) and states that it proves global convergence plus accumulation at a stationary point under additional regularity assumptions. No quoted step reduces the claimed proof to a self-definition, a fitted parameter renamed as prediction, or a load-bearing self-citation chain; the structure is invoked only to keep subproblems tractable, while the convergence argument is presented as a separate, self-contained contribution. Numerical examples are offered as supporting evidence rather than as the basis for the theoretical claims.
Axiom & Free-Parameter Ledger
Reference graph
Works this paper leans on
-
[1]
Journal of the Royal Statistical Society Series B: Statistical Methodology58(1), 267–288 (1996)
Tibshirani, R.: Regression shrinkage and selection via the lasso. Journal of the Royal Statistical Society Series B: Statistical Methodology58(1), 267–288 (1996)
work page 1996
-
[2]
Springer, Berlin, Heidelberg, New York (2006)
Nocedal, J., Wright, S.J.: Numerical Optimization, 2nd edn. Springer, Berlin, Heidelberg, New York (2006)
work page 2006
-
[3]
Advances in Mechanics and Mathematics, vol
Capatina, A.: Variational Inequalities and Frictional Contact Problems. Advances in Mechanics and Mathematics, vol. 31. Springer, Germany (2014)
work page 2014
-
[4]
Artzner, P., Delbaen, F., Eber, J.-M., Heath, D.: Coherent measures of risk. Math. Finance9(3), 203–228 (1999) https://doi.org/10.1111/1467-9965.00068
-
[5]
Springer Series in Operations Research and Financial Engineering Series
Dentcheva, D., Ruszczy´ nski, A.: Risk-Averse Optimization and Control: Theory and Methods. Springer Series in Operations Research and Financial Engineering Series. Springer, Germany (2024)
work page 2024
-
[6]
MOS-SIAM Series on Optimization
Shapiro, A., Dentcheva, D., Ruszczynski, A.: Lectures on Stochastic Program- ming: Modeling and Theory, Second Edition. MOS-SIAM Series on Optimization. Society for Industrial and Applied Mathematics, Philadelphia (2014)
work page 2014
-
[7]
SIAM Journal on Scientific Computing35(4), 1847–1879 (2013) https://doi.org/10.1137/120892362
Kouri, D.P., Heinkenschloss, M., Ridzal, D., Bloemen Waanders, B.G.: A trust- region algorithm with adaptive stochastic collocation for PDE optimization under uncertainty. SIAM Journal on Scientific Computing35(4), 1847–1879 (2013) https://doi.org/10.1137/120892362
-
[8]
SIAM Journal on Scientific Computing36(6), 3011–3029 (2014) https://doi.org/10.1137/140955665
Kouri, D.P., Heinkenschloss, M., Ridzal, D., Bloemen Waanders, B.G.: Inexact objective function evaluations in a trust-region algorithm for PDE-constrained optimization under uncertainty. SIAM Journal on Scientific Computing36(6), 3011–3029 (2014) https://doi.org/10.1137/140955665
-
[9]
(eds.) Optimization of PDEs with Uncertain Inputs, pp
Kouri, D.P., Shapiro, A.: In: Antil, H., Kouri, D.P., Lacasse, M.-D., Ridzal, D. (eds.) Optimization of PDEs with Uncertain Inputs, pp. 41–81. Springer, New York, NY (2018)
work page 2018
-
[10]
SIAM Journal on Optimization26(1), 365–396 (2016) https://doi.org/10.1137/140954556
Kouri, D.P., Surowiec, T.M.: Risk-averse PDE-constrained optimization using the conditional value-at-risk. SIAM Journal on Optimization26(1), 365–396 (2016) https://doi.org/10.1137/140954556
-
[11]
Kouri, D.P., Surowiec, T.M.: Existence and optimality conditions for risk-averse PDE-constrained optimization. SIAM/ASA Journal on Uncertainty Quantifica- tion6(2), 787–815 (2018) https://doi.org/10.1137/16M1086613
-
[12]
Mathematical Programming193(1), 337–363 (2022) 26
Kouri, D.P., Surowiec, T.M.: A primal–dual algorithm for risk minimization. Mathematical Programming193(1), 337–363 (2022) 26
work page 2022
-
[13]
Mathematical Modelling: Theory and Applications, vol
Hinze, M., Pinnau, R., Ulbrich, M., Ulbrich, S.: Optimization with PDE Con- straints. Mathematical Modelling: Theory and Applications, vol. 23, p. 270. Springer, Germany (2009)
work page 2009
-
[14]
Mathematical Programming201(1), 1–40 (2022)
Baraldi, R.J., Kouri, D.P.: A proximal trust-region method for nonsmooth optimization with inexact function and gradient evaluations. Mathematical Programming201(1), 1–40 (2022)
work page 2022
-
[15]
Optimization Letters18(3), 663–680 (2024)
Baraldi, R.J., Kouri, D.P.: Local convergence analysis of an inexact trust-region method for nonsmooth optimization. Optimization Letters18(3), 663–680 (2024)
work page 2024
-
[16]
Classics in Applied Mathematics, Vol
Ekeland, I., Temam, R.: Convex Analysis and Variational Problems. Classics in Applied Mathematics, Vol. 28. SIAM, Philadelphia (1999)
work page 1999
-
[17]
Clarke, F.H.: Nonsmooth Analysis and Control Theory. Graduate Texts in Mathematics. Springer, Germany (1998)
work page 1998
-
[18]
Bauschke, H.H., Combettes, P.L.: Convex Analysis and Monotone Operator Theory in Hilbert Spaces. CMS Books in Mathematics. Springer, Germany (2017)
work page 2017
-
[19]
Computational Optimization and applications62(3), 669–692 (2015)
Chen, M., Mehrotra, S., Papp, D.: Scenario generation for stochastic optimiza- tion problems via the sparse grid method. Computational Optimization and applications62(3), 669–692 (2015)
work page 2015
-
[20]
Acta Numerica34, 491–577 (2025)
Heinkenschloss, M., Kouri, D.P.: Optimization problems governed by systems of pdes with uncertainties. Acta Numerica34, 491–577 (2025)
work page 2025
-
[21]
Conn, A.R., Gould, N.I.M., Toint, P.L.: Trust–Region Methods. SIAM, Philadel- phia (2000)
work page 2000
-
[22]
SIAM Journal on Optimization9(4), 1100–1127 (1999)
Lin, C.-J., Mor´ e, J.J.: Newton’s method for large bound-constrained optimization problems. SIAM Journal on Optimization9(4), 1100–1127 (1999)
work page 1999
-
[23]
Computational Optimization and Applications, (2024)
Baraldi, R.J., Kouri, D.P.: Efficient subproblem solvers for a nonsmooth trust- region method. Computational Optimization and Applications, (2024)
work page 2024
- [24]
-
[25]
SIAM Journal on Optimization10(4), 1196–1211 (2000)
Birgin, E.G., Mart´ ınez, J.M., Raydan, M.: Nonmonotone spectral projected gra- dient methods on convex sets. SIAM Journal on Optimization10(4), 1196–1211 (2000)
work page 2000
-
[26]
Numerical Functional Analysis and Optimization37(2), 129–144 (2016)
Bello Cruz, J.: On weak and strong convergence of the projected gradient method for convex optimization in real hilbert spaces. Numerical Functional Analysis and Optimization37(2), 129–144 (2016)
work page 2016
-
[27]
Salzo, S., Villa, S.,et al.: Inexact and accelerated proximal point algorithms. J. Convex Anal19(4), 1167–1192 (2012) 27
work page 2012
-
[28]
SIAM Journal on Optimization23(3), 1607–1633 (2013) https://doi.org/10.1137/110844805
Villa, S., Salzo, S., Baldassarre, L., Verri, A.: Accelerated and inexact forward- backward algorithms. SIAM Journal on Optimization23(3), 1607–1633 (2013) https://doi.org/10.1137/110844805
-
[29]
Springer, Berlin, Heidelberg, New York (2000)
Bonnans, J.F., Shapiro, A.: Perturbation Analysis of Optimization Problems. Springer, Berlin, Heidelberg, New York (2000)
work page 2000
-
[30]
De Gruyter Series in Nonlinear Analysis and Applications
Emelyanov, S.V., Korovin, S.K., Bobylev, N.A., Bulatov, A.V.: Homotopy of Extremal Problems: Theory and Applications. De Gruyter Series in Nonlinear Analysis and Applications. De Gruyter, Germany (2011)
work page 2011
-
[31]
SIAM Journal on Optimization2(1), 121–152 (1992) https://doi.org/10.1137/0802008
Schramm, H., Zowe, J.: A version of the bundle idea for minimizing a nonsmooth function: Conceptual idea, convergence analysis, numerical results. SIAM Journal on Optimization2(1), 121–152 (1992) https://doi.org/10.1137/0802008
-
[32]
ESAIM: Control, Optimisation and Calculus of Variations26, 53 (2020) 28
Kouri, D., Surowiec, T.: Risk-averse optimal control of semilinear elliptic pdes. ESAIM: Control, Optimisation and Calculus of Variations26, 53 (2020) 28
work page 2020
discussion (0)
Sign in with ORCID, Apple, or X to comment. Anyone can read and Pith papers without signing in.