pith. sign in

arxiv: 2406.19607 · v2 · pith:QC2PLPTQnew · submitted 2024-06-28 · 🧮 math.OC

Closed-loop equilibria for Stackelberg games: a story about stochastic targets

Pith reviewed 2026-05-23 23:51 UTC · model grok-4.3

classification 🧮 math.OC
keywords stochastic Stackelberg gamesclosed-loop strategiesbackward stochastic differential equationstarget constraintsHamilton-Jacobi-Bellman equationsstochastic control
0
0 comments X

The pith

Stackelberg games under closed-loop strategies reduce to single-level stochastic control with target constraints.

A machine-rendered reading of the paper's core claim, the machinery that carries it, and where it could break.

The paper develops a method to turn any continuous-time stochastic Stackelberg differential game, where both leader and follower choose drift and volatility to maximize expected utilities, into a standard optimization problem when players use only the history of the output process. It treats the second-order BSDE that tracks the follower's continuation utility as an extra controlled state variable available to the leader. This converts the leader's problem into one with terminal target constraints that can be solved by adapting existing techniques for stochastic control with targets, yielding a system of Hamilton-Jacobi-Bellman equations whose solution gives the equilibrium strategies and value. The approach is illustrated on a simple example that compares closed-loop solutions to those under other information structures.

Core claim

By considering the second-order backward stochastic differential equation associated with the continuation utility of the follower as a controlled state variable for the leader, the latter's unconventional optimisation problem can be reformulated as a more standard stochastic control problem with target constraints. Thereafter the optimal strategies and equilibrium value are characterised through the solution of a system of Hamilton-Jacobi-Bellman equations.

What carries the argument

The second-order backward stochastic differential equation for the follower's continuation utility, adjoined as an additional controlled state variable for the leader.

If this is right

  • The bi-level Stackelberg problem becomes a single-level stochastic control problem with terminal constraints.
  • Equilibrium strategies and value functions are recovered from solutions to a system of Hamilton-Jacobi-Bellman equations.
  • The method applies to games where both players control drift and volatility of the same output process.
  • Numerical and theoretical comparisons with open-loop or other information structures become feasible via the resulting HJB system.

Where Pith is reading between the lines

These are editorial extensions of the paper, not claims the author makes directly.

  • The reformulation may allow existing numerical solvers for target-constrained control to be applied directly to Stackelberg settings.
  • It suggests a route to closed-loop solutions in other hierarchical stochastic games where the follower's value process can be written as a BSDE.
  • If the HJB system admits unique solutions, the method would guarantee existence of closed-loop equilibria without requiring the leader to observe the noise.

Load-bearing premise

The follower's continuation-utility BSDE can be adjoined as a controlled state for the leader while preserving decisions based only on output history and without introducing explicit dependence on the unobservable driving noise.

What would settle it

An explicit closed-loop equilibrium strategy obtained from the HJB system that cannot be implemented using only the observed output path and requires direct knowledge of the driving noise.

Figures

Figures reproduced from arXiv: 2406.19607 by Camilo Hern\'andez, Dylan Possama\"i, Emma Hubert, Nicol\'as Hern\'andez Santib\'a\~nez.

Figure 1
Figure 1. Figure 1: Comparison of the value functions for various information concepts, with cF = cL = 1, and a◦ = 10. We first remark that for the four sets of parameters, we have the following inequalities for the leader’s value function, V AOL L = V AF L < V ACLM−K¯ L < V CL L < V ACL L = V FB L , and the converse inequalities for the follower’s value. Most of these inequalities were to be expected, as already mentioned in… view at source ↗
Figure 2
Figure 2. Figure 2: Comparison of the value functions for various information concepts, with cF = 1, cL = 1.25, and a◦ = 10. Comparing [PITH_FULL_IMAGE:figures/full_fig_p018_2.png] view at source ↗
Figure 3
Figure 3. Figure 3: Comparison of the value functions for various information concepts, with cF = 1.25, cL = 1, and a◦ = 10. 18 [PITH_FULL_IMAGE:figures/full_fig_p018_3.png] view at source ↗
Figure 4
Figure 4. Figure 4: Comparison of the value functions for various information concepts, with cF = 1, cL = 1, and a◦ = 15. Finally, comparing [PITH_FULL_IMAGE:figures/full_fig_p019_4.png] view at source ↗
read the original abstract

We provide a general approach to reformulating any continuous-time stochastic Stackelberg differential game under closed-loop strategies as a single-level optimisation problem with target constraints. More precisely, we consider a Stackelberg game in which the leader and the follower can both control the drift and the volatility of a stochastic output process, in order to maximise their respective expected utility. The aim is to characterise the Stackelberg equilibrium when the players adopt 'closed-loop strategies', i.e. their decisions are based solely on the historical information of the output process, excluding especially any direct dependence on the underlying driving noise, often unobservable in real-world applications. We first show that, by considering the second-order backward stochastic differential equation associated with the continuation utility of the follower as a controlled state variable for the leader, the latter's unconventional optimisation problem can be reformulated as a more standard stochastic control problem with target constraints. Thereafter, adapting the methodology developed by Soner and Touzi (2002a) or Bouchard, Elie and Imbert (2010), the optimal strategies, as well as the corresponding value of the Stackelberg equilibrium, can be characterised through the solution of a well-specified system of Hamilton- Jacobi-Bellman equations. For a more comprehensive insight, we illustrate our approach through a simple example, facilitating both theoretical and numerical detailed comparisons with the solutions under different information structures studied in the literature.

Editorial analysis

A structured set of objections, weighed in public.

Desk editor's note, referee report, simulated authors' rebuttal, and a circularity audit. Tearing a paper down is the easy half of reading it; the pith above is the substance, this is the friction.

Referee Report

2 major / 2 minor

Summary. The paper claims to reformulate any continuous-time stochastic Stackelberg differential game under closed-loop strategies (decisions based only on the history of the output process X) as a single-level stochastic control problem with target constraints. This is achieved by adjoining the second-order BSDE for the follower's continuation utility as an additional controlled state for the leader; the resulting problem is then solved via HJB equations adapting Soner-Touzi (2002) and Bouchard et al. (2010), with an illustrative example for comparison across information structures.

Significance. If the adjoining step is rigorously justified, the approach would extend standard stochastic control methods (with target constraints) to closed-loop Stackelberg games where the driving noise is unobservable, providing a general framework beyond open-loop or full-information cases and enabling numerical comparisons in the example.

major comments (2)
  1. [Reformulation of the leader's problem] The central reformulation (abstract and the step adjoining the follower's 2BSDE) treats the 2BSDE solution as a controlled state while claiming to preserve the closed-loop filtration generated by X alone. No verification is supplied that this state process remains adapted to the observable sigma-field of X (without explicit dependence on the unobservable W), which is load-bearing for the single-level problem to be a valid closed-loop Stackelberg equilibrium; the cited external methods do not automatically guarantee this under the paper's information structure.
  2. [HJB characterization step] The subsequent HJB characterization (abstract) is asserted to yield the optimal strategies and equilibrium value, but the manuscript supplies no proof sketch, verification theorem, or error analysis for the reformulated control problem with the adjoined state; this leaves the soundness of the equilibrium characterization unassessed beyond the high-level claim.
minor comments (2)
  1. The abstract refers to 'a well-specified system of Hamilton-Jacobi-Bellman equations' without indicating its dimension or the form of the target constraints in the general case; a brief outline would improve readability.
  2. Notation for the output process, controls, and filtrations could be introduced more explicitly at the start of the main text to aid comparison with the cited literature on different information structures.

Simulated Author's Rebuttal

2 responses · 0 unresolved

We thank the referee for their careful reading of the manuscript and for highlighting these important points regarding the rigor of the reformulation. We address the major comments point by point below.

read point-by-point responses
  1. Referee: [Reformulation of the leader's problem] The central reformulation (abstract and the step adjoining the follower's 2BSDE) treats the 2BSDE solution as a controlled state while claiming to preserve the closed-loop filtration generated by X alone. No verification is supplied that this state process remains adapted to the observable sigma-field of X (without explicit dependence on the unobservable W), which is load-bearing for the single-level problem to be a valid closed-loop Stackelberg equilibrium; the cited external methods do not automatically guarantee this under the paper's information structure.

    Authors: We agree that an explicit verification of the adaptation to the filtration generated by X is necessary to confirm that the reformulated problem indeed corresponds to a closed-loop equilibrium. In the manuscript, the 2BSDE is constructed with coefficients that depend on the closed-loop controls, which are functions of the history of X, ensuring by construction that its solution is adapted to the observable filtration. However, we acknowledge that this is not stated explicitly as a separate result. In the revision, we will add a short lemma verifying that the solution to the follower's 2BSDE remains adapted to the sigma-field generated by X, without direct dependence on the unobservable Brownian motion W. This will strengthen the justification for adjoining it as a controlled state. revision: yes

  2. Referee: [HJB characterization step] The subsequent HJB characterization (abstract) is asserted to yield the optimal strategies and equilibrium value, but the manuscript supplies no proof sketch, verification theorem, or error analysis for the reformulated control problem with the adjoined state; this leaves the soundness of the equilibrium characterization unassessed beyond the high-level claim.

    Authors: The HJB system is obtained by direct application of the methodology in Soner and Touzi (2002) and Bouchard et al. (2010) to the reformulated stochastic control problem with target constraints and the additional controlled state from the 2BSDE. While the manuscript relies on these references for the characterization, we concur that including a brief outline of how the verification theorem applies in this augmented setting would be beneficial. We will add a short proof sketch in the revised version, detailing the key steps of the verification argument and noting any modifications required due to the presence of the adjoined state and the target constraints. revision: yes

Circularity Check

0 steps flagged

No circularity; reformulation adapts external Soner-Touzi and Bouchard methodologies without self-reduction

full rationale

The paper's core step adjoins the follower's 2BSDE as a controlled state and then applies the cited external frameworks of Soner-Touzi (2002) and Bouchard-Elie-Imbert (2010) to obtain an HJB system. These citations are independent, externally published, and not self-citations by the present authors. No equation or claim reduces by construction to a fitted parameter or prior result defined inside this manuscript; the closed-loop information structure is preserved by the problem setup rather than by redefinition. The derivation therefore remains non-circular.

Axiom & Free-Parameter Ledger

0 free parameters · 0 axioms · 0 invented entities

Abstract-only review; no free parameters, axioms, or invented entities are identifiable from the provided text.

pith-pipeline@v0.9.0 · 5800 in / 1136 out tokens · 18058 ms · 2026-05-23T23:51:03.585729+00:00 · methodology

discussion (0)

Sign in with ORCID, Apple, or X to comment. Anyone can read and Pith papers without signing in.

Forward citations

Cited by 3 Pith papers

Reviewed papers in the Pith corpus that reference this work. Sorted by Pith novelty score.

  1. Principal-agent problems with adverse selection: A stochastic target problem formulation

    econ.TH 2026-05 unverdicted novelty 7.0

    Agent's optimization in unique-contract principal-agent problem with adverse selection is recast as stochastic target problem, enabling principal's objective as stochastic optimal control with partial information and ...

  2. Principal-agent problems with adverse selection: A stochastic target problem formulation

    econ.TH 2026-05 unverdicted novelty 6.0

    Principal-agent adverse selection with unique contracts is reformulated as a stochastic target problem for the agent and a stochastic optimal control problem for the principal.

  3. Optimal control of Volterra integral diffusions and application to contract theory

    math.PR 2025-11 conditional novelty 6.0

    The value function for optimal control of non-convolution Volterra integral diffusions is characterized as the unique viscosity solution to a parabolic PDE on Sobolev space, with applications to time-inconsistent cont...

Reference graph

Works this paper leans on

87 extracted references · 87 canonical work pages · cited by 2 Pith papers

  1. [1]

    R. Aïd, M. Basei, and H. Pham. A McKean–Vlasov approach to distributed electricity generation development.Mathe- matical Methods of Operations Research, 91:269–310, 2020

  2. [2]

    A. Bagchi. Stackelberg differential games in economic models, volume 64 of Lecture notes in control and information sciences. Springer Berlin, Heidelberg, 1984. 32

  3. [3]

    T. Başar. Stochastic stagewise Stackelberg strategies for linear quadratic systems. In M. Kohlmann and W. Vogel, editors, Stochastic control theory and stochastic differential systems. Proceedings of a workshop of the „Sonderforschungsbereich 72 der Deutschen Forschungsgemeinschaft an der Universität Bonn” which took place in January 1979 at Bad Honnef, v...

  4. [4]

    T. Başar. A new method for the Stackelberg solution of differential games with sampled-data state information.IFAC Proceedings Volumes, 14(2):1365–1370, 1981

  5. [5]

    Başar and A

    T. Başar and A. Haurie. Feedback equilibria in differential games with structural and modal uncertainties. In J.B. Cruz Jr., editor, Advances in large scale systems, volume 1, pages 163–201. 1984

  6. [6]

    Başar and G.J

    T. Başar and G.J. Olsder. Team-optimal closed-loop Stackelberg strategies in hierarchical control problems.Automatica, 16(4):409–414, 1980

  7. [7]

    Başar and G.J

    T. Başar and G.J. Olsder.Dynamic noncooperative game theory. SIAM, 2nd revised edition, 1999

  8. [8]

    Başar and H

    T. Başar and H. Selbuz. A new approach for derivation of closed-loop Stackelberg strategies. In R.E. Larson and A.S. Willsky, editors,1978 IEEE conference on decision and control including the 17th symposium on adaptive processes, pages 1113–1118, 1978

  9. [9]

    Başar and H

    T. Başar and H. Selbuz. Closed-loop Stackelberg strategies with applications in the optimal control of multilevel systems. IEEE Transactions on Automatic Control, 24(2):166–179, 1979

  10. [10]

    Bensoussan, S

    A. Bensoussan, S. Chen, and S.P. Sethi. Feedback Stackelberg solutions of infinite-horizon stochastic differential ames. In F. El Ouardighi and K. Kogan, editors,Models and methods in economics and management science: essays in honor of Charles S. Tapiero, volume 198 ofInternational series in operations research & management science, pages 3–15. Springer ...

  11. [11]

    Bensoussan, S

    A. Bensoussan, S. Chen, and S.P. Sethi. The maximum principle for global solutions of stochastic Stackelberg differential games. SIAM Journal on Control and Optimization, 53(4):1956–1981, 2015

  12. [12]

    Bensoussan, S

    A. Bensoussan, S. Chen, A. Chutani, S.P. Sethi, C.C. Siu, and S.C.P. Yam. Feedback Stackelberg–Nash equilibria in mixed leadership games with an application to cooperative advertising.SIAM Journal on Control and Optimization, 57 (5):3413–3444, 2019

  13. [13]

    Bouchard, R

    B. Bouchard, R. Élie, and N. Touzi. Stochastic target problems with controlled loss. SIAM Journal on Control and Optimization, 48(5):3123–3150, 2009

  14. [14]

    Bouchard, R

    B. Bouchard, R. Élie, and C. Imbert. Optimal control under stochastic target constraints.SIAM Journal on Control and Optimization, 48(5):3501–3531, 2010

  15. [15]

    A. Bressan. Noncooperative differential games.Milan Journal of Mathematics, 79:357–427, 2011

  16. [16]

    Carmona.Lectures on BSDEs, stochastic control, and stochastic differential games with financial applications, volume 1 of Financial mathematics

    R. Carmona.Lectures on BSDEs, stochastic control, and stochastic differential games with financial applications, volume 1 of Financial mathematics. SIAM, 2016

  17. [17]

    Castanon

    D.A. Castanon. Equilibria in stochastic dynamic games of Stackelberg type. PhD thesis, Massachusetts Institute of Tech- nology, 1976

  18. [18]

    Stackelbergsolutionfortwo-persongameswithbiasedinformationpatterns

    C.I.ChenandJ.B.CruzJr. Stackelbergsolutionfortwo-persongameswithbiasedinformationpatterns. IEEE Transactions on Automatic Control, 17(6):791–798, 1972

  19. [19]

    Chutani and S.P

    A. Chutani and S.P. Sethi. A feedback Stackelberg game of cooperative advertising in a durable goods oligopoly. In J. Haunschmied, V.M. Veliov, and S. Wrzaczek, editors,Dynamic games in economics, volume 16 ofDynamic modeling and econometrics in economics and finance, pages 89–114. Springer, 2014

  20. [20]

    Cong and J

    W. Cong and J. Shi. Direct approach of linear–quadratic Stackelberg mean field games of backward–forward stochastic systems. ArXiv preprint arXiv:2401.15835, 2024

  21. [21]

    Crandall, H

    M.G. Crandall, H. Ishii, and P.-L. Lions. User’s guide to viscosity solutions of second order partial differential equations. Bulletin of the American Mathematical Society, 27(1):1–67, 1992

  22. [22]

    J.B. Cruz Jr. Survey of Nash and Stackelberg equilibrium strategies in dynamic games. InAnnals of economic and social measurement, volume 4, pages 339–344. National Bureau of Economic Research, 1975

  23. [23]

    J.B. Cruz Jr. Stackelberg strategies for multilevel systems. In Y.C. Ho and S.K. Mitter, editors,Directions in large-scale systems, pages 139–147. Springer New York, NY, 1976. 33

  24. [24]

    Cvitanić and J

    J. Cvitanić and J. Zhang.Contract theory in continuous-time models. Springer, 2012

  25. [25]

    Cvitanić, D

    J. Cvitanić, D. Possamaï, and N. Touzi. Dynamic programming approach to principal–agent problems.Finance and Stochastics, 22(1):1–37, 2018

  26. [26]

    Dayanıklı and M

    G. Dayanıklı and M. Laurière. A machine learning method for Stackelberg mean field games. ArXiv preprint arXiv:2302.10440, 2023

  27. [27]

    Dockner, S

    E.J. Dockner, S. Jorgensen, N. Van Long, and G. Sorger. Differential games in economics and management science. Cambridge University Press, 2000

  28. [28]

    El Karoui and X

    N. El Karoui and X. Tan. Capacities, measurable selection and dynamic programming part II: application in stochastic control problems. Technical report, École Polytechnique and université Paris-Dauphine, 2013

  29. [29]

    X. Feng, Y. Hu, and J. Huang. Backward Stackelberg differential game with constraints: a mixed terminal-perturbation and linear–quadratic approach.SIAM Journal on Control and Optimization, 60(3):1488–1518, 2022

  30. [30]

    Fu and U

    G. Fu and U. Horst. Mean-field leader–follower games with terminal state constraint.SIAM Journal on Control and Optimization, 58(4):2078–2113, 2020

  31. [31]

    B Gardner and J.B. Cruz Jr. Feedback Stackelberg strategy for a two player game.IEEE Transactions on Automatic Control, 22(2):270–271, 1977

  32. [32]

    Gou, N.-J

    Z. Gou, N.-J. Huang, and M.-H. Wang. A linear–quadratic mean-field stochastic Stackelberg differential game with random exit time. International Journal of Control, 96(3):731–745, 2023

  33. [33]

    G. Guan, Z. Liang, and Y. Song. A Stackelberg reinsurance–investment game underα-maxmin mean–variance criterion and stochastic volatility.Scandinavian Actuarial Journal, 2024(1):28–63, 2024

  34. [34]

    X. Han, D. Landriault, and D. Li. Optimal reinsurance contract in a Stackelberg game framework: a view of social planner. Scandinavian Actuarial Journal, 2024(2):124–148, 2024

  35. [35]

    Havrylenko, M

    Y. Havrylenko, M. Hinken, and R. Zagst. Risk sharing in equity-linked insurance products: Stackelberg equilibrium between an insurer and a reinsurer.ArXiv preprint arXiv:2203.04053, 2022

  36. [36]

    X. He, A. Prasad, S.P. Sethi, and G.J. Gutierrez. A survey of Stackelberg differential game models in supply and marketing channels. Journal of Systems Science and Systems Engineering, 16:385–413, 2007

  37. [37]

    X. He, A. Prasad, and S.P. Sethi. Cooperative advertising and pricing in a dynamic stochastic supply chain: feedback Stackelberg strategies. In D.F. Kocaoglu, T.R. Anderson, and T.U. Daim, editors,PICMET ’08, Portland international conference on management of engineering & technology, pages 1634–1649, 2008

  38. [38]

    Huang and J

    Q. Huang and J. Shi. A verification theorem for Stackelberg stochastic differential games in feedback information pattern. ArXiv preprint arXiv:2108.06498, 2021

  39. [39]

    Kang and J

    K. Kang and J. Shi. A three-level stochastic linear–quadratic Stackelberg differential game with asymmetric information. ArXiv preprint arXiv:2210.11808, 2022

  40. [40]

    Karandikar

    R.L. Karandikar. On pathwise stochastic integration.Stochastic Processes and their Applications, 57(1):11–18, 1995

  41. [41]

    Leitmann

    G. Leitmann. On generalized Stackelberg strategies. Journal of Optimization Theory and Applications, 26(4):637–643, 1978

  42. [42]

    H. Li, J. Xu, and H. Zhang. Closed–loop Stackelberg strategy for linear–quadratic leader–follower game.ArXiv preprint arXiv:2212.08977, 2022

  43. [43]

    Li and Z

    N. Li and Z. Yu. Forward–backward stochastic differential equations and linear–quadratic generalized Stackelberg games. SIAM Journal on Control and Optimization, 56(6):4148–4180, 2018

  44. [44]

    Li and S.P

    T. Li and S.P. Sethi. A review of dynamic Stackelberg game models.Discrete & Continuous Dynamical Systems–B, 22(1): 125–129, 2017

  45. [45]

    Li and S

    Y. Li and S. Han. Solving strongly convex and smooth Stackelberg games without modeling the follower.ArXiv preprint arXiv:2303.06192, 2023

  46. [46]

    Li and J

    Z. Li and J. Shi. Closed-loop solvability of linear quadratic mean-field type Stackelberg stochastic differential games.ArXiv preprint arXiv:2303.07544, 2023. 34

  47. [47]

    Li and J

    Z. Li and J. Shi. Linear quadratic leader–follower stochastic differential games: closed-loop solvability.Journal of Systems Science and Complexity, 36(4):1373–1406, 2023

  48. [48]

    J. Liu, Y. Fan, Z. Chen, and Y. Zheng. Pessimistic bilevel optimization: a survey.International Journal of Computational Intelligence Systems, 11(1):725–736, 2018

  49. [49]

    S. Lv, J. Xiong, and X. Zhang. Linear quadratic leader–follower stochastic differential games for mean-field switching diffusions. Automatica, 154(111072):1–9, 2023

  50. [50]

    Mallozzi and J

    L. Mallozzi and J. Morgan. Weak Stackelberg problem and mixed solutions under data perturbations.Optimization, 32 (3):269–290, 1995

  51. [51]

    J. Moon. Linear–quadratic stochastic Stackelberg differential games for jump–diffusion systems.SIAM Journal on Control and Optimization, 59(2):954–976, 2021

  52. [52]

    Y.-H. Ni, L. Liu, and X. Zhang. Deterministic dynamic Stackelberg games: time-consistent open-loop solution.Automatica, 148(110757):1–9, 2023

  53. [53]

    M. Nutz. Pathwise construction of stochastic integrals.Electronic Communications in Probability, 17(24):1–7, 2012

  54. [54]

    Øksendal, L

    B. Øksendal, L. Sandal, and J. Ubøe. Stochastic Stackelberg equilibria with applications to time-dependent newsvendor models. Journal of Economic Dynamics and Control, 37(7):1284–1299, 2013

  55. [55]

    Papavassilopoulos

    G.P. Papavassilopoulos. Leader–follower and Nash strategies with state information. PhD thesis, University of Illinois at Urbana-Champaign, 1979

  56. [56]

    Papavassilopoulos and J.B

    G.P. Papavassilopoulos and J.B. Cruz Jr. Nonclassical control problems and Stackelberg games.IEEE Transactions on Automatic Control, 24(2):155–166, 1979

  57. [57]

    Papavassilopoulos and J.B

    G.P. Papavassilopoulos and J.B. Cruz Jr. Sufficient conditions for Stackelberg and Nash strategies with memory.Journal of Optimization Theory and Applications, 31(2):233–260, 1980

  58. [58]

    Possamaï, X

    D. Possamaï, X. Tan, and C. Zhou. Stochastic control for a class of nonlinear kernels and applications.The Annals of Probability, 46(1):551–603, 2018

  59. [59]

    Possamaï, N

    D. Possamaï, N. Touzi, and J. Zhang. Zero-sum path-dependent stochastic differential games in weak formulation.The Annals of Applied Probability, 30(3):1415–1457, 2020

  60. [60]

    Z. Ren, X. Tan, N. Touzi, and J. Yang. Entropic optimal planning for path-dependent mean field games.SIAM Journal on Control and Optimization, 61(3):1415–1437, 2023

  61. [61]

    Rockafellar

    R.T. Rockafellar. Convex analysis. Princeton University Press, 1970

  62. [62]

    J. Shi, G. Wang, and J. Xiong. Leader–follower stochastic differential game with asymmetric information and applications. Automatica, 63:60–73, 2016

  63. [63]

    Si and Z

    K. Si and Z. Wu. Backward–forward linear–quadratic mean-field Stackelberg games.Advances in Difference Equations, 2021(73):1–23, 2021

  64. [64]

    Simaan and J.B

    M. Simaan and J.B. Cruz Jr. Additional aspects of the Stackelberg strategy in nonzero-sum games.Journal of Optimization Theory and Applications, 11(6):613–626, 1973

  65. [65]

    Simaan and J.B

    M. Simaan and J.B. Cruz Jr. On the Stackelberg strategy in nonzero-sum games.Journal of Optimization Theory and Applications, 11(5):533–555, 1973

  66. [66]

    Simaan and J.B

    M. Simaan and J.B. Cruz Jr. On the Stackelberg strategy in nonzero-sum games. In G. Leitmann, editor,Multicriteria decision making and differential games, Mathematical concepts and methods in science and engineering, pages 173–195. Springer New York, NY, 1976

  67. [67]

    Soner and N

    H.M. Soner and N. Touzi. Dynamic programming for stochastic target problems and geometric flows.Journal of the European Mathematical Society, 4(3):201–236, 2002

  68. [68]

    Soner and N

    H.M. Soner and N. Touzi. Stochastic target problems, dynamic programming, and viscosity solutions.SIAM Journal on Control and Optimization, 41(2):404–424, 2002

  69. [69]

    Soner and N

    H.M. Soner and N. Touzi. A stochastic representation for mean curvature type geometric flows.The Annals of Probability, 31(3):1145–1165, 2003. 35

  70. [70]

    Soner, N

    H.M. Soner, N. Touzi, and J. Zhang. Wellposedness of second order backward SDEs.Probability Theory and Related Fields, 153(1–2):149–190, 2012

  71. [71]

    Stroock and S.R.S

    D.W. Stroock and S.R.S. Varadhan.Multidimensional diffusion processes, volume 233 ofGrundlehren der mathematischen Wissenschaften. Springer-Verlag Berlin Heidelberg, 1997

  72. [72]

    J. Sun, H. Wang, and J. Wen. Zero-sum Stackelberg stochastic linear–quadratic differential games.SIAM Journal on Control and Optimization, 61(1):252–284, 2023

  73. [73]

    Van Long

    N. Van Long. A survey of dynamic games in economics, volume 1 of Surveys on theories in economics and business administration. World Scientific, 2010

  74. [74]

    D. Vasal. Master equation of discrete-time Stackelberg mean field games with multiple leaders. ArXiv preprint arXiv:2209.03186, 2022

  75. [75]

    D. Vasal. Sequential decomposition of stochastic Stackelberg games. In B. Ferri and F. Zhang, editors,2022 American control conference, pages 1266–1271. IEEE, 2022

  76. [76]

    von Stackelberg.Marktform und Gleichgewicht

    H. von Stackelberg.Marktform und Gleichgewicht. Springer-Verlag Wien New York, 1934

  77. [77]

    G. Wang, Y. Wang, and S. Zhang. An asymmetric information mean-field type linear–quadratic stochastic Stackelberg differential game with one leader and two followers.Optimal Control Applications and Methods, 41(4):1034–1051, 2020

  78. [78]

    Wiesemann, A

    W. Wiesemann, A. Tsoukalas, P.-M. Kleniati, and B. Rustem. Pessimistic bilevel optimization.SIAM Journal on Opti- mization, 23(1):353–380, 2013

  79. [79]

    Z. Wu. A general maximum principle for optimal control of forward–backward stochastic systems.Automatica, 49(5): 1473–1480, 2013

  80. [80]

    J. Yong. A leader–follower stochastic linear quadratic differential game.SIAM Journal on Control and Optimization, 41 (4):1015–1041, 2002

Showing first 80 references.