Recognition: unknown
On the Connectedness of Sublevel Sets in Invex Optimization
Pith reviewed 2026-05-10 15:08 UTC · model grok-4.3
The pith
Sublevel sets of invex functions are connected under mild assumptions, aiding global optimization.
A machine-rendered reading of the paper's core claim, the machinery that carries it, and where it could break.
Core claim
The sublevel sets of invex functions are connected under mild assumptions. This is shown with a mathematical toolkit based on the topological mountain pass theorem. The result is further used to establish connectedness of solution sets for invex-incave minimax problems and incave games.
What carries the argument
The topological mountain pass theorem toolkit, which proves connectedness of sublevel sets for invex functions.
If this is right
- Local search algorithms become less likely to be trapped in isolated valleys.
- Convergence to global minimizers is facilitated for invex functions.
- The solution sets of invex-incave minimax problems are connected.
- The solution sets of incave games are connected.
Where Pith is reading between the lines
- The result could lead to improved global convergence guarantees for optimization methods on invex functions.
- Similar toolkits might be developed for other classes of functions beyond invex ones.
- Practical verification could involve checking the connectedness on specific invex functions arising in applications.
Load-bearing premise
Invex functions must satisfy the mild assumptions that enable the application of the mountain pass theorem toolkit.
What would settle it
An invex function that meets the mild assumptions but has a disconnected sublevel set for some value would show the claim is false.
Figures
read the original abstract
Understanding the topology of sublevel sets yields crucial insights into the optimization landscape of non-convex functions. If sublevel sets are connected, local search algorithms are less likely to be trapped in isolated valleys, facilitating convergence to global minimizers. However, few results exist to establish connectedness in the nonconvex setting. In this work, we present a mathematical toolkit based on the topological mountain pass theorem and use it to study invex functions, a class of functions that includes those satisfying the Polyak-{\L}ojasiewicz inequality and generalizations thereof. We show that their sublevel sets are connected under mild assumptions. We further leverage our result to establish the connectedness of different solution sets for invex-incave minimax problems and incave games.
Editorial analysis
A structured set of objections, weighed in public.
Referee Report
Summary. The manuscript develops a toolkit based on the topological mountain pass theorem to prove that sublevel sets of invex functions (including those satisfying the Polyak-Łojasiewicz inequality) are connected under mild assumptions. It further applies the result to establish connectedness of solution sets for invex-incave minimax problems and incave games.
Significance. If rigorously established, the result would offer a useful topological characterization of optimization landscapes for a wide class of non-convex functions, potentially explaining why local search methods can reach global minimizers without being trapped in disconnected valleys.
major comments (2)
- [Main theorem and its proof (Section 3)] The proof of the main connectedness result proceeds by contradiction: a disconnected sublevel set {f ≤ c} (c > inf f) would produce, via the mountain-pass theorem, a critical point at some level d > inf f. Invexity then forces every critical point to be a global minimizer, yielding the desired contradiction. However, standard statements of the mountain-pass theorem (including the version invoked in the manuscript) require a compactness condition such as Palais-Smale at the mountain-pass level. The paper states only that the functions are invex and satisfy “mild assumptions” for the toolkit; it neither proves that invexity implies Palais-Smale nor supplies verifiable conditions under which the compactness holds uniformly for the class. This assumption is load-bearing for the contradiction step.
- [Introduction and statement of main results (Section 1)] The “mild assumptions” required for the mountain-pass toolkit are never listed explicitly or shown to be satisfied by the invex functions under consideration. Without a precise statement of these assumptions (e.g., a concrete Palais-Smale-type condition or a coercivity requirement), it is impossible to determine the precise scope of the claimed connectedness result.
minor comments (2)
- [Abstract] The abstract refers to “mild assumptions” without any indication of what they are; a brief parenthetical or forward reference to the precise hypotheses would improve readability.
- [Section 2] Notation for the mountain-pass level and the critical-value set is introduced without a dedicated preliminary subsection; a short “Notation and preliminaries” paragraph would help readers track the topological arguments.
Simulated Author's Rebuttal
We thank the referee for the detailed and constructive report. The comments highlight important points regarding the explicit statement of assumptions in our use of the mountain-pass theorem. We will revise the manuscript to address these concerns by providing a clear enumeration of the mild assumptions and discussing conditions under which they hold for invex functions.
read point-by-point responses
-
Referee: [Main theorem and its proof (Section 3)] The proof of the main connectedness result proceeds by contradiction: a disconnected sublevel set {f ≤ c} (c > inf f) would produce, via the mountain-pass theorem, a critical point at some level d > inf f. Invexity then forces every critical point to be a global minimizer, yielding the desired contradiction. However, standard statements of the mountain-pass theorem (including the version invoked in the manuscript) require a compactness condition such as Palais-Smale at the mountain-pass level. The paper states only that the functions are invex and satisfy “mild assumptions” for the toolkit; it neither proves that invexity implies Palais-Smale nor supplies verifiable conditions under which the compactness holds uniformly for the class. This assumption is load-bearing for the contradiction step.
Authors: We agree that the Palais-Smale condition is essential for the mountain-pass theorem and is implicitly included among the mild assumptions. In the revised manuscript, we will explicitly list all assumptions in a new paragraph in Section 1 and restate them at the beginning of Section 3. We will clarify that the functions are assumed to satisfy a Palais-Smale-type condition at levels strictly above the infimum. We will also add a remark providing verifiable sufficient conditions (e.g., coercivity on reflexive Banach spaces or the Polyak-Łojasiewicz inequality together with C^1 regularity) under which invex functions satisfy this compactness requirement. This makes the scope of the result precise while preserving the proof structure. revision: yes
-
Referee: [Introduction and statement of main results (Section 1)] The “mild assumptions” required for the mountain-pass toolkit are never listed explicitly or shown to be satisfied by the invex functions under consideration. Without a precise statement of these assumptions (e.g., a concrete Palais-Smale-type condition or a coercivity requirement), it is impossible to determine the precise scope of the claimed connectedness result.
Authors: We accept this criticism. The revised introduction will contain an explicit bullet-point list of the mild assumptions, including the Palais-Smale condition at mountain-pass levels and any auxiliary topological requirements. We will also include a short discussion of concrete classes (such as invex functions satisfying the Polyak-Łojasiewicz inequality) where these assumptions are known to hold, thereby delineating the applicability of the connectedness theorem. revision: yes
Circularity Check
No significant circularity; derivation applies external theorem to invex definition
full rationale
The derivation proceeds by applying the topological mountain-pass theorem (an external result) to the standard property of invex functions that every critical point is a global minimizer. A contradiction argument then shows that sublevel sets above the infimum cannot be disconnected. This chain depends on the definition of invexity plus the stated mild assumptions needed for the mountain-pass toolkit; it does not reduce the claimed connectedness result to a self-definition, a fitted parameter renamed as prediction, or a load-bearing self-citation whose content is itself unverified. The assumptions are treated as prerequisites rather than derived from the conclusion, and no ansatz or renaming of known results occurs. The derivation is therefore self-contained against external benchmarks.
Axiom & Free-Parameter Ledger
axioms (2)
- domain assumption Invex functions include those satisfying the Polyak-Lojasiewicz inequality and generalizations thereof
- standard math The topological mountain pass theorem can be applied to establish connectedness of sublevel sets
Reference graph
Works this paper leans on
-
[1]
Invex programs: First order algorithms and their convergence, 2023
Adarsh Barik, Suvrit Sra, and Jean Honorio. Invex programs: First order algorithms and their convergence, 2023
2023
-
[2]
B. Douglas Bernheim. Rationalizable strategic behavior. Econometrica, 52 0 (4): 0 1007, July 1984. ISSN 0012-9682. doi:10.2307/1911196
-
[3]
Bertsekas
Dimitri P. Bertsekas. Convex optimization theory. Athena Scientific optimization and computation series. Athena Scientific, Belmont, Mass, 5th print. edition, 2014. ISBN 9781886529311
2014
-
[4]
Global minima of overparameterized neural networks
Yaim Cooper. Global minima of overparameterized neural networks. SIAM Journal on Mathematics of Data Science, 3 0 (2): 0 676--691, January 2021. ISSN 2577-0187. doi:10.1137/19m1308943
-
[5]
If a smooth function is globally PŁ and coercive, then it has a unique minimizer, 2025
Chris Criscitiello, Quentin Rebjock, and Nicolas Boumal. If a smooth function is globally PŁ and coercive, then it has a unique minimizer, 2025. URL www.racetothebottom.xyz/posts/PL-smooth-unique/
2025
-
[6]
Asen L. Dontchev and R. Tyrrell Rockafellar. Implicit Functions and Solution Mappings: A View from Variational Analysis. Springer New York, 2014. ISBN 9781493910373. doi:10.1007/978-1-4939-1037-3
-
[7]
Essentially no barriers in neural network energy landscape
Felix Draxler, Kambis Veschgini, Manfred Salmhofer, and Fred Hamprecht. Essentially no barriers in neural network energy landscape. In Jennifer Dy and Andreas Krause, editors, Proceedings of the 35th International Conference on Machine Learning, volume 80 of Proceedings of Machine Learning Research, pages 1309--1318. PMLR, 10--15 Jul 2018. URL https://pro...
2018
-
[8]
e Silva and Marco Antonio Teixeira
Elves Alves B. e Silva and Marco Antonio Teixeira. A version of rolle's theorem and applications. Boletim da Sociedade Brasileira de Matem \'a tica - Bulletin/Brazilian Mathematical Society , 29: 0 301--327, 1998. URL https://api.semanticscholar.org/CorpusID:123019170
1998
-
[9]
Optimizing static linear feedback: Gradient method
Ilyas Fatkhullin and Boris Polyak. Optimizing static linear feedback: Gradient method. SIAM Journal on Control and Optimization, 59 0 (5): 0 3887--3911, January 2021. ISSN 1095-7138. doi:10.1137/20m1329858
-
[10]
Loss surfaces, mode connectivity, and fast ensembling of dnns
Timur Garipov, Pavel Izmailov, Dmitrii Podoprikhin, Dmitry P Vetrov, and Andrew G Wilson. Loss surfaces, mode connectivity, and fast ensembling of dnns. In S. Bengio, H. Wallach, H. Larochelle, K. Grauman, N. Cesa-Bianchi, and R. Garnett, editors, Advances in Neural Information Processing Systems, volume 31. Curran Associates, Inc., 2018. URL https://proc...
2018
-
[11]
arXiv preprint arXiv:2501.00429 , year=
Yun Gong, Niao He, and Zebang Shen. Poincare inequality for local log-polyak- ojasiewicz measures: Non-asymptotic analysis in low-temperature regime, 2025. URL https://arxiv.org/abs/2501.00429
-
[12]
Goodfellow, Jean Pouget-Abadie, Mehdi Mirza, Bing Xu, David Warde-Farley, Sherjil Ozair, Aaron Courville, and Yoshua Bengio
Ian J. Goodfellow, Jean Pouget-Abadie, Mehdi Mirza, Bing Xu, David Warde-Farley, Sherjil Ozair, Aaron Courville, and Yoshua Bengio. Generative adversarial nets. In Z. Ghahramani, M. Welling, C. Cortes, N. Lawrence, and K.Q. Weinberger, editors, Advances in Neural Information Processing Systems, volume 27. Curran Associates, Inc., 2014. URL https://proceed...
2014
-
[13]
On sufficiency of the kuhn-tucker conditions
Morgan A Hanson. On sufficiency of the kuhn-tucker conditions. Journal of Mathematical Analysis and Applications, 80 0 (2): 0 545--550, April 1981. ISSN 0022-247X. doi:10.1016/0022-247x(81)90123-2
-
[14]
Garrett Hardin. The tragedy of the commons: The population problem has no technical solution; it requires a fundamental extension in morality. Science, 162 0 (3859): 0 1243--1248, December 1968. ISSN 1095-9203. doi:10.1126/science.162.3859.1243
-
[15]
Notes on introductory point-set topology
Allen Hatcher. Notes on introductory point-set topology. Lecture notes available online, 2005. URL https://pi.math.cornell.edu/ hatcher/Top/TopNotes.pdf
2005
-
[16]
E. Heine. Die elemente der functionenlehre. Journal für die reine und angewandte Mathematik, 74: 0 172--188, 1872. URL http://eudml.org/doc/148175
-
[17]
Linear convergence of gradient and proximal-gradient methods under the polyak- ojasiewicz condition
Hamed Karimi, Julie Nutini, and Mark Schmidt. Linear convergence of gradient and proximal-gradient methods under the polyak- ojasiewicz condition. In Paolo Frasconi, Niels Landwehr, Giuseppe Manco, and Jilles Vreeken, editors, Machine Learning and Knowledge Discovery in Databases, pages 795--811, Cham, 2016. Springer International Publishing. ISBN 978-3-3...
2016
-
[18]
Mountain pass theorems and global homeomorphism theorems
Guy Katriel. Mountain pass theorems and global homeomorphism theorems. Annales de l'I.H.P. Analyse non lin\'eaire, 11 0 (2): 0 189--209, 1994. URL http://www.numdam.org/item/AIHPC_1994__11_2_189_0/
1994
-
[19]
Loss landscapes and optimization in over-parameterized non-linear systems and neural networks
Chaoyue Liu, Libin Zhu, and Mikhail Belkin. Loss landscapes and optimization in over-parameterized non-linear systems and neural networks. Applied and Computational Harmonic Analysis, 59: 0 85--116, July 2022. ISSN 1063-5203. doi:10.1016/j.acha.2021.12.009
-
[20]
arXiv preprint arXiv:2505.05991 , year=
Saeed Masiha, Zebang Shen, Negar Kiyavash, and Niao He. Superquantile-gibbs relaxation for minima-selection in bi-level optimization, 2025. URL https://arxiv.org/abs/2505.05991
-
[21]
On the global convergence rates of softmax policy gradient methods
Jincheng Mei, Chenjun Xiao, Csaba Szepesvari, and Dale Schuurmans. On the global convergence rates of softmax policy gradient methods. In Hal Daumé III and Aarti Singh, editors, Proceedings of the 37th International Conference on Machine Learning, volume 119 of Proceedings of Machine Learning Research, pages 6820--6829. PMLR, 13--18 Jul 2020. URL https://...
2020
-
[22]
Learning in games with continuous action sets and unknown payoff functions
Panayotis Mertikopoulos and Zhengyuan Zhou. Learning in games with continuous action sets and unknown payoff functions. Mathematical Programming, 173 0 (1–2): 0 465--507, March 2018. ISSN 1436-4646. doi:10.1007/s10107-018-1254-8
-
[23]
Shashi Kant Mishra and Giorgio Giorgi. Invexity and Optimization. Springer Berlin Heidelberg, 2008. ISBN 9783540785620. doi:10.1007/978-3-540-78562-0
-
[24]
Dov Monderer and Lloyd S. Shapley. Potential games. Games and Economic Behavior, 14 0 (1): 0 124--143, May 1996. ISSN 0899-8256. doi:10.1006/game.1996.0044
-
[25]
James R. Munkres. Topology. Always learning. Pearson, Harlow, Essex, second, pearson new international edition. edition, 2014. ISBN 1-292-03678-8
2014
-
[26]
Rosemarie Nagel. Unraveling in guessing games: An experimental study. The American Economic Review, 85 0 (5): 0 1313--1326, 1995. ISSN 00028282. URL http://www.jstor.org/stable/2950991
-
[27]
John Nash. Non-cooperative games. The Annals of Mathematics, 54 0 (2): 0 286, sep 1951. doi:10.2307/1969529
-
[28]
Learning invariances with generalised input-convex neural networks, 2022
Vitali Nesterov, Fabricio Arend Torres, Monika Nagy-Huber, Maxim Samarin, and Volker Roth. Learning invariances with generalised input-convex neural networks, 2022
2022
-
[29]
Correlated equilibrium and potential games
Abraham Neyman. Correlated equilibrium and potential games. International Journal of Game Theory, 26 0 (2): 0 223--227, June 1997. ISSN 1432-1270. doi:10.1007/bf01295851
-
[30]
On connected sublevel sets in deep learning
Quynh Nguyen. On connected sublevel sets in deep learning. In Kamalika Chaudhuri and Ruslan Salakhutdinov, editors, Proceedings of the 36th International Conference on Machine Learning, volume 97 of Proceedings of Machine Learning Research, pages 4790--4799. PMLR, 09--15 Jun 2019. URL https://proceedings.mlr.press/v97/nguyen19a.html
2019
-
[31]
Revisiting invex functions: Explicit kernel constructions and applications, 2025
Akatsuki Nishioka. Revisiting invex functions: Explicit kernel constructions and applications, 2025
2025
-
[32]
David G. Pearce. Rationalizable strategic behavior and the problem of perfection. Econometrica, 52 0 (4): 0 1029, July 1984. ISSN 0012-9682. doi:10.2307/1911197
-
[33]
R. Pini. Convexity along curves and indunvexity. Optimization, 29 0 (4): 0 301--309, January 1994. ISSN 1029-4945. doi:10.1080/02331939408843959
-
[34]
Invexity and generalized convexity
Rita Pini. Invexity and generalized convexity. Optimization, 22 0 (4): 0 513--525, January 1991. ISSN 1029-4945. doi:10.1080/02331939108843693
-
[35]
Gradient methods for the minimisation of functionals
Boris Polyak. Gradient methods for the minimisation of functionals. Ussr Computational Mathematics and Mathematical Physics, 3: 0 864--878, 12 1963. doi:10.1016/0041-5553(63)90382-3
-
[36]
Extensions of the mountain pass theorem
Patrizia Pucci and James Serrin. Extensions of the mountain pass theorem. Journal of Functional Analysis, 59 0 (2): 0 185--210, November 1984. ISSN 0022-1236. doi:10.1016/0022-1236(84)90072-7
-
[37]
Princeton University Press (1970)
Ralph Tyrell Rockafellar. Convex Analysis. Princeton University Press, December 1970. ISBN 9781400873173. doi:10.1515/9781400873173
-
[38]
J. B. Rosen. Existence and uniqueness of equilibrium points for concave n-person games. Econometrica, 33 0 (3): 0 520, July 1965. ISSN 0012-9682. doi:10.2307/1911749
-
[39]
Robert W. Rosenthal. A class of games possessing pure-strategy nash equilibria. International Journal of Game Theory, 2 0 (1): 0 65--67, December 1973. ISSN 1432-1270. doi:10.1007/bf01737559
-
[40]
Input invex neural network, 2021
Suman Sapkota and Binod Bhattarai. Input invex neural network, 2021
2021
-
[41]
Dale O. Stahl and Paul W. Wilson. On players' models of other players: Theory and experimental evidence. Games and Economic Behavior, 10 0 (1): 0 218--254, July 1995. ISSN 0899-8256. doi:10.1006/game.1995.1031
-
[42]
J. v. Neumann. Zur theorie der gesellschaftsspiele. Mathematische Annalen, 100 0 (1): 0 295--320, dec 1928. doi:10.1007/bf01448847
-
[43]
Global convergence and variance reduction for a class of nonconvex-nonconcave minimax problems
Junchi Yang, Negar Kiyavash, and Niao He. Global convergence and variance reduction for a class of nonconvex-nonconcave minimax problems. Advances in Neural Information Processing Systems, 33: 0 1153--1165, 2020
2020
-
[44]
I. Zang, E. U. Choo, and M. Avriel. On functions whose stationary points are global minima. Journal of Optimization Theory and Applications, 22 0 (2): 0 195--208, June 1977. ISSN 1573-2878. doi:10.1007/bf00933162
-
[45]
Doan, and Justin Romberg
Sihan Zeng, Thinh T. Doan, and Justin Romberg. Connected superlevel set in (deep) reinforcement learning and its application to minimax theorems. In Proceedings of the 37th International Conference on Neural Information Processing Systems, NIPS '23, Red Hook, NY, USA, 2023. Curran Associates Inc
2023
-
[46]
Variational policy gradient method for reinforcement learning with general utilities
Junyu Zhang, Alec Koppel, Amrit Singh Bedi, Csaba Szepesvari, and Mengdi Wang. Variational policy gradient method for reinforcement learning with general utilities. In H. Larochelle, M. Ranzato, R. Hadsell, M.F. Balcan, and H. Lin, editors, Advances in Neural Information Processing Systems, volume 33, pages 4572--4583. Curran Associates, Inc., 2020. URL h...
2020
-
[47]
Łojasiewicz
S. Łojasiewicz. Une propriété topologique des sous-ensembles analytiques réels. In Colloques Internationaux du C.N.R.S. 117: Les Équations aux Dérivées Partielles, pages 87--89, 1963
1963
discussion (0)
Sign in with ORCID, Apple, or X to comment. Anyone can read and Pith papers without signing in.