Distributed games with jumps: An $\alpha$-potential game approach

arxiv: 2508.01929 · v2 · submitted 2025-08-03 · 🧮 math.OC · cs.MA· math.PR

Distributed games with jumps: An α-potential game approach

Xin Guo , Xinyu Li , Yufei Zhang This is my paper

Pith reviewed 2026-05-19 00:45 UTC · model grok-4.3

classification 🧮 math.OC cs.MAmath.PR

keywords α-potential gamesdistributed gamesjump diffusionsNash equilibriacrowd motion networksportfolio selectionstochastic control

0 comments p. Extension

The pith

Analyzing α-Nash equilibria in distributed games with jump diffusions reduces to solving a finite-dimensional control problem.

A machine-rendered reading of the paper's core claim, the machinery that carries it, and where it could break.

The paper places distributed games with jump diffusions into the α-potential game framework to simplify their equilibrium analysis. It shows that the search for α-Nash equilibria becomes equivalent to solving a finite-dimensional control problem instead of handling the full stochastic dynamics directly. Explicit results follow for crowd-motion networks, where α vanishes on symmetric graphs and decays in controlled ways on asymmetric ones, and for an N-player mean-variance portfolio game that is shown to be a potential game with an explicit equilibrium allowing heterogeneous preferences. The reduction also supports numerical solution via policy-gradient methods.

Core claim

For distributed games with jump diffusions that belong to the α-potential class, the analysis of α-Nash equilibria reduces to solving a finite-dimensional control problem. Viscosity and verification characterizations are obtained for the general case. In crowd-motion network games α equals zero for every symmetric interaction network and decays polynomially or logarithmically with the number of players, network degree, and interaction asymmetry in asymmetric networks. The N-player portfolio-selection game under mean-variance criteria is a potential game whose Nash equilibrium is constructed explicitly even when players have heterogeneous preference parameters.

What carries the argument

The α-potential function, which converts the search for α-Nash equilibria into an equivalent finite-dimensional optimal-control problem.

Load-bearing premise

The distributed games with jump diffusions are assumed to belong to the α-potential game class.

What would settle it

A concrete distributed game with jumps whose α-Nash equilibria cannot be recovered by solving the corresponding finite-dimensional control problem would falsify the reduction.

Figures

Figures reproduced from arXiv: 2508.01929 by Xin Guo, Xinyu Li, Yufei Zhang.

**Figure 1.** Figure 1: Equilibrium trajectories in the aversion game with a Gaussian kernel and uniform interaction weights. Left: mean positions over 500 simulations. Right: one representative trajectory. The solid circle denotes the shared initial position; the cross marks the common target. Markers “1”, “2”, and “3” indicate positions at times 0.25, 0.5, and 0.75, respectively. 6.2.2. Flocking Games with Idiosyncratic Noises.… view at source ↗

**Figure 2.** Figure 2: Equilibrium trajectories in the flocking game with a quadratic kernel and uniform interaction weights. Left: mean positions over 500 simulations. Right: one representative trajectory. The solid circle denotes the shared initial position; the crosses mark the individual targets. Markers “1”, “2”, and “3” indicate positions at times 0.25, 0.5, and 0.75, respectively [PITH_FULL_IMAGE:figures/full_fig_p019_2.png] view at source ↗

**Figure 3.** Figure 3: Equilibrium trajectories in the flocking game with a quadratic kernel and group-based interaction weights. Players 1 and 4 belong to one group, and players 2 and 3 form the other. The interaction weights qij = 1 if players i and j are in the same group, and qij = 0 otherwise. Left: mean positions over 500 simulations. Right: one representative trajectory. The solid circle denotes the shared initial positio… view at source ↗

**Figure 4.** Figure 4: Equilibrium trajectories in the flocking game with a quadratic kernel, uniform interaction weights, and pure common jumps. The solid circle denotes the shared initial position; the crosses mark the individual targets. Markers “1”, “2”, and “3” indicate positions at times 0.25, 0.5, and 0.75, respectively. the intermediate paths, the overall flocking behavior remains consistent with the patterns observed in… view at source ↗

read the original abstract

Motivated by game-theoretic models of crowd motion dynamics, this paper analyzes a broad class of distributed games with jump diffusions within the recently developed $\alpha$-potential game framework. We demonstrate that analyzing the $\alpha$-Nash equilibria reduces to solving a finite-dimensional control problem. Beyond the viscosity and verification characterizations for the general games, we examine explicitly and in detail how spatial population distributions and interaction rules influence the structure of $\alpha$-Nash equilibria in these distributed settings. For crowd motion network games, we show that $\alpha = 0$ for all symmetric interaction networks, and or asymmetric networks. We quantify the precise polynomial and logarithmic decays of $\alpha$ in terms of the number of players, the degree of the network, and the decay rate of interaction asymmetry. We also exploit the $\alpha$-potential game framework to analyze an $N$-player portfolio selection game under a mean-variance criterion. We show that this portfolio game constitutes a potential game and explicitly construct its Nash equilibrium. Our analysis allows for heterogeneous preference parameters, going beyond the mean-field interactions considered in the existing game literature. Our theoretical results are supported by numerical implementations using policy gradient-based algorithms, demonstrating the computational advantages of the $\alpha$-potential game framework in computing Nash equilibria for general dynamic games.

Editorial analysis

A structured set of objections, weighed in public.

Desk editor's note, referee report, simulated authors' rebuttal, and a circularity audit. Tearing a paper down is the easy half of reading it; the pith above is the substance, this is the friction.

Referee Report

3 major / 3 minor

Summary. The paper analyzes distributed games with jump diffusions in the recently developed α-potential game framework. It claims that α-Nash equilibria reduce to finite-dimensional control problems, supplies viscosity and verification characterizations, derives explicit polynomial and logarithmic decays of α for symmetric and asymmetric crowd-motion network games (in terms of player number, network degree, and asymmetry decay), and explicitly constructs the Nash equilibrium for an N-player mean-variance portfolio game allowing heterogeneous preferences. Numerical policy-gradient implementations are provided to illustrate computational advantages.

Significance. If the central reduction holds, the work supplies a concrete route from nonlocal jump-diffusion games to tractable control problems, with explicit α-decay rates and equilibrium constructions that go beyond mean-field limits. The explicit treatment of network asymmetry and heterogeneous portfolio preferences, together with reproducible numerical support, strengthens the contribution to dynamic game theory and its applications in crowd dynamics and finance.

major comments (3)

[§3] §3 (general framework): The reduction of α-Nash analysis to a finite-dimensional control problem presupposes that the jump-diffusion games satisfy the α-potential property. The nonlocal jump integral in the generator can violate the required variational inequality unless the interaction kernel obeys symmetry or quantified asymmetry decay; no explicit moment or intensity bounds on the jump measure are supplied to guarantee uniformity of α when jumps are large or frequent. This is load-bearing for the abstract claim that a broad class reduces to control.
[§4.2] §4.2 (asymmetric networks): The polynomial and logarithmic decay rates for α are stated in terms of network degree and asymmetry decay, yet the derivation appears to invoke an additional decay assumption on the kernel that is not listed among the standing hypotheses; without it the bound may fail to be uniform in N.
[§5] §5 (portfolio game): The claim that the heterogeneous-preference mean-variance game is a potential game (hence α = 0) is asserted after the jump terms are introduced, but the verification that the jump compensator preserves the exact potential property (rather than only an α-approximation) is not shown explicitly; this step is required to justify the closed-form equilibrium construction.

minor comments (3)

[Abstract] The abstract sentence 'and or asymmetric networks' is grammatically incomplete and should be repaired.
[§2] Notation for the jump measure and its compensator should be introduced once in §2 and used consistently thereafter.
[Numerical section] Figure captions for the numerical examples would benefit from explicit statements of the jump intensity and network parameters used.

Simulated Author's Rebuttal

3 responses · 0 unresolved

We thank the referee for the careful reading and constructive comments on our manuscript. We address each major comment point by point below, providing clarifications and indicating where revisions will be made to strengthen the presentation.

read point-by-point responses

Referee: [§3] §3 (general framework): The reduction of α-Nash analysis to a finite-dimensional control problem presupposes that the jump-diffusion games satisfy the α-potential property. The nonlocal jump integral in the generator can violate the required variational inequality unless the interaction kernel obeys symmetry or quantified asymmetry decay; no explicit moment or intensity bounds on the jump measure are supplied to guarantee uniformity of α when jumps are large or frequent. This is load-bearing for the abstract claim that a broad class reduces to control.

Authors: We agree that uniformity of α requires control on the jump measure. The manuscript establishes the α-potential property under the stated symmetry or quantified asymmetry conditions on the interaction kernel, which ensure the variational inequality holds for the generator. To guarantee uniformity when jumps are large or frequent, we will add explicit moment and intensity bounds on the Lévy measure in the revised Section 3. These assumptions will be listed among the standing hypotheses and will confirm that the reduction to finite-dimensional control remains valid for the broad class of games considered. revision: yes
Referee: [§4.2] §4.2 (asymmetric networks): The polynomial and logarithmic decay rates for α are stated in terms of network degree and asymmetry decay, yet the derivation appears to invoke an additional decay assumption on the kernel that is not listed among the standing hypotheses; without it the bound may fail to be uniform in N.

Authors: The decay assumption on the kernel is used in the derivation and is implicit in the setup for asymmetric networks, but we acknowledge it was not explicitly enumerated in the list of standing hypotheses. In the revision we will add this condition explicitly to the hypotheses in Section 4.2, reference it directly in the proof of the decay rates, and verify that the polynomial and logarithmic bounds remain uniform in N under the stated assumptions. revision: yes
Referee: [§5] §5 (portfolio game): The claim that the heterogeneous-preference mean-variance game is a potential game (hence α = 0) is asserted after the jump terms are introduced, but the verification that the jump compensator preserves the exact potential property (rather than only an α-approximation) is not shown explicitly; this step is required to justify the closed-form equilibrium construction.

Authors: We thank the referee for highlighting this omission. The continuous-part potential property is verified, but the explicit check for the jump compensator was not written out. In the revised Section 5 we will insert a direct verification that the compensator term preserves the exact potential property (hence α = 0 for the full jump-diffusion game), thereby justifying the closed-form Nash equilibrium construction for heterogeneous preferences. revision: yes

Circularity Check

0 steps flagged

No significant circularity; derivation applies external framework with independent verifications

full rationale

The paper invokes the recently developed α-potential game framework to reduce α-Nash analysis to a finite-dimensional control problem, then supplies new viscosity/verification characterizations plus explicit constructions for jump-diffusion network games (showing α=0 on symmetric interactions and polynomial/logarithmic decay rates on asymmetric ones) and for heterogeneous portfolio selection. These steps rely on direct verification of the α-potential property under the paper's own jump-integral generator and interaction kernels rather than reducing by definition, fitted parameters, or unverified self-citation chains. The central reduction holds conditionally on membership in the α-potential class, which the paper establishes independently for the examined cases without smuggling ansatzes or renaming known results.

Axiom & Free-Parameter Ledger

0 free parameters · 1 axioms · 0 invented entities

The central claim rests primarily on the domain assumption that the class of distributed games with jump diffusions fits the α-potential game framework, enabling the reduction and explicit results; no free parameters or invented entities are evident from the abstract.

axioms (1)

domain assumption Distributed games with jump diffusions belong to the α-potential game class
This assumption is invoked to reduce α-Nash equilibria to finite-dimensional control problems and to derive the network and portfolio results.

pith-pipeline@v0.9.0 · 5761 in / 1370 out tokens · 64387 ms · 2026-05-19T00:45:21.244069+00:00 · methodology

discussion (0)

Lean theorems connected to this paper

Citations machine-checked in the Pith Canon. Every link opens the source theorem in the public Lean library.

IndisputableMonolith/Cost/FunctionalEquation.lean washburn_uniqueness_aczel echoes

?

echoes
ECHOES: this paper passage has the same mathematical shape or conceptual pattern as the Recognition theorem, but is not a direct formal dependency.

We say G is an α-potential game for α≥0 if there exists Φ such that |Ji(u′i,u−i)−Ji(ui,u−i)−(Φ(u′i,u−i)−Φ(ui,u−i))|≤α (Def 2.4); for symmetric networks α=0 (Cor 6.1(a))

What do these tags mean?

matches: The paper's claim is directly supported by a theorem in the formal canon.
supports: The theorem supports part of the paper's argument, but the paper may add assumptions or extra steps.
extends: The paper goes beyond the formal theorem; the theorem is a base layer rather than the whole result.
uses: The paper appears to rely on the theorem as machinery.
contradicts: The paper's claim conflicts with a theorem or certificate in the canon.
unclear: Pith found a possible connection, but the passage is too broad, indirect, or ambiguous to say the theorem truly supports the claim.

Forward citations

Cited by 1 Pith paper

Reviewed papers in the Pith corpus that reference this work. Sorted by Pith novelty score.

NePPO: Near-Potential Policy Optimization for General-Sum Multi-Agent Reinforcement Learning
cs.LG 2026-03 unverdicted novelty 7.0

NePPO learns a player-independent potential function via a novel objective whose minimization yields an approximate Nash equilibrium for general-sum multi-agent games.

Reference graph

Works this paper leans on

46 extracted references · 46 canonical work pages · cited by 1 Pith paper

[1]

Aghajani and A

A. Aghajani and A. Doustmohammadi. Formation control of multi-vehicle systems using co- operative game theory. In 2015 15th International Conference on Control, Automation and Systems (ICCAS), pages 704–709. IEEE, 2015

work page 2015
[2]

Aurell, R

A. Aurell, R. Carmona, and M. Lauriere. Stochastic graphon games: II. the linear-quadratic case. Applied Mathematics & Optimization , 85(3):39, 2022

work page 2022
[3]

Aurell and B

A. Aurell and B. Djehiche. Mean-field type modeling of nonlocal crowd aversion in pedestrian crowd dynamics. SIAM Journal on Control and Optimization , 56(1):434–455, 2018

work page 2018
[4]

Barles, R

G. Barles, R. Buckdahn, and E. Pardoux. Backward stochastic differential equations and integral-partial differential equations. Stochastics: An International Journal of Probability and Stochastic Processes, 60(1-2):57–83, 1997

work page 1997
[5]

A. Blum, E. Even-Dar, and K. Ligett. Routing without regret: On convergence to nash equilibria of regret-minimizing algorithms in routing games. Theory of Computing , 6(1):179– 199, 2010

work page 2010
[6]

R. Carmona. Lectures on BSDEs, stochastic control, and stochastic differential games with financial applications. SIAM, 2016

work page 2016
[7]

Carmona, Q

R. Carmona, Q. Cormier, and H. M. Soner. Synchronization in a kuramoto mean field game. Communications in Partial Differential Equations , 48(9):1214–1244, 2023

work page 2023
[8]

Carmona and F

R. Carmona and F. Delarue. Probabilistic Theory of Mean Field Games with Applications I: Mean Field FBSDEs, Control, and Games , volume 83. Springer, 2018

work page 2018
[9]

Colombo and D

A. Colombo and D. Del Vecchio. Efficient algorithms for collision avoidance at intersections. In Proceedings of the 15th ACM international conference on Hybrid Systems: Computation and Control, pages 145–154, 2012

work page 2012
[10]

Cont and E

R. Cont and E. Voltchkova. A finite difference scheme for option pricing in jump diffusion and exponential L´ evy models.SIAM Journal on Numerical Analysis , 43(4):1596–1626, 2005. α-POTENTIAL DISTRIBUTED GAMES WITH JUMPS 21

work page 2005
[11]

Dumitrescu, M.-C

R. Dumitrescu, M.-C. Quenez, and A. Sulem. A weak dynamic programming principle for combined optimal stopping/stochastic control with E f-expectations. SIAM Journal on Control and Optimization, 54(4):2090–2115, 2016

work page 2090
[12]

Dumitrescu, M.-C

R. Dumitrescu, M.-C. Quenez, and A. Sulem. Mixed generalized Dynkin game and stochastic control in a Markovian framework. Stochastics, 89(1):400–429, 2017

work page 2017
[13]

Dumitrescu, C

R. Dumitrescu, C. Reisinger, and Y. Zhang. Approximation schemes for mixed optimal stop- ping and control problems with nonlinear expectations and jumps. Applied Mathematics & Optimization, 83(3):1387–1429, 2021

work page 2021
[14]

Giegrich, C

M. Giegrich, C. Reisinger, and Y. Zhang. Convergence of policy gradient methods for finite- horizon exploratory linear-quadratic control problems. SIAM Journal on Control and Opti- mization, 62(2):1060–1092, 2024

work page 2024
[15]

X. Guo, X. Li, C. Maheshwari, S. Sastry, and M. Wu. Markov α-potential games: Equilibrium approximation and regret analysis. arXiv preprint arXiv:2305.12553 , 2023

work page arXiv 2023
[16]

X. Guo, X. Li, and L. Zhang. Bsde approach for α-potential stochastic differential games. arXiv preprint arXiv:2507.13256 , 2025

work page arXiv 2025
[17]

X. Guo, X. Li, and Y. Zhang. An α-potential game framework for N-player games. arXiv:2403.16962, 2024

work page arXiv 2024
[18]

Guo and Y

X. Guo and Y. Zhang. Towards an analytical framework for dynamic potential games. SIAM Journal on Control and Optimization , 63(2):1213–1242, 2025

work page 2025
[19]

R. Hu. Deep fictitious play for stochastic differential games. Communications in Mathematical Sciences, 19(2):325–353, 2021

work page 2021
[20]

Jackson and D

J. Jackson and D. Lacker. Approximately optimal distributed stochastic controls beyond the mean field setting. arXiv preprint arXiv:2301.02901 , 2023

work page arXiv 2023
[21]

E. R. Jakobsen and K. H. Karlsen. Continuous dependence estimates for viscosity solutions of integro-pdes. Journal of Differential Equations , 212(2):278–318, 2005

work page 2005
[22]

Kalaria, C

D. Kalaria, C. Maheshwari, and S. Sastry. α-racer: Real-time algorithm for game-theoretic motion planning and control in autonomous racing using near-potential function.arXiv preprint arXiv:2412.08855, 2024

work page arXiv 2024
[23]

Kavuncu, A

T. Kavuncu, A. Yaraneri, and N. Mehr. Potential iLQR: A potential-minimizing controller for planning multi-agent interactive trajectories. arXiv preprint arXiv:2107.04926 , 2021

work page arXiv 2021
[24]

D. P. Kingma and J. Ba. Adam: A method for stochastic optimization, 2017

work page 2017
[25]

Krichene, W

S. Krichene, W. Krichene, R. Dong, and A. Bayen. Convergence of heterogeneous distributed learning in stochastic routing games. In 2015 53rd Annual Allerton Conference on Communi- cation, Control, and Computing (Allerton) , pages 480–487. IEEE, 2015

work page 2015
[26]

H. Kunita. Stochastic Differential Equations Based on L´ evy Processes and Stochastic Flows of Diffeomorphisms, pages 305–373. Birkh¨ auser Boston, Boston, MA, 2004

work page 2004
[27]

Lachapelle and M.-T

A. Lachapelle and M.-T. Wolfram. On a mean field game approach modeling congestion and aversion in pedestrian crowds. Transportation research part B: methodological , 45(10):1572– 1589, 2011

work page 2011
[28]

L. Lu, R. Hu, X. Yang, and Y. Zhu. Multiagent relative investment games in a jump diffusion market with deep reinforcement learning algorithm. SIAM Journal on Financial Mathematics , 16(2):707–746, 2025

work page 2025
[29]

Z. Ma, D. S. Callaway, and I. A. Hiskens. Decentralized charging control of large populations of plug-in electric vehicles. IEEE Transactions on control systems technology , 21(1):67–78, 2011

work page 2011
[30]

Maheshwari, M

C. Maheshwari, M. Wu, and S. Sastry. Convergence of decentralized actor-critic algorithm in general-sum markov games. IEEE Control Systems Letters , 2024

work page 2024
[31]

Mazumdar, L

E. Mazumdar, L. J. Ratliff, M. I. Jordan, and S. S. Sastry. Policy-gradient algorithms have no guarantees of convergence in linear quadratic games. In Proceedings of the 19th International Conference on Autonomous Agents and MultiAgent Systems , pages 860–868, 2020. 22 α-POTENTIAL DISTRIBUTED GAMES WITH JUMPS

work page 2020
[32]

Narasimha, K

D. Narasimha, K. Lee, D. Kalathil, and S. Shakkottai. Multi-agent learning via markov poten- tial games in marketplaces for distributed energy resources. In 2022 IEEE 61st Conference on Decision and Control (CDC) , pages 6350–6357. IEEE, 2022

work page 2022
[33]

Nourian, P

M. Nourian, P. E. Caines, and R. P. Malham´ e. Mean field analysis of controlled Cucker- Smale type flocking: Linear analysis and perturbation equations. IFAC Proceedings Volumes, 44(1):4471–4476, 2011

work page 2011
[34]

Paccagnan, M

D. Paccagnan, M. Kamgarpour, and J. Lygeros. On aggregative and mean field games with applications to electricity markets. In 2016 European Control Conference (ECC) , pages 196–

work page 2016
[35]

H. Pham. Optimal stopping of controlled jump diffusion processes: a viscosity solution ap- proach. J. Math. Syst. Estimat. Control , 8(1):1, 1998

work page 1998
[36]

Ramchurn, P

S. Ramchurn, P. Vytelingum, A. Rogers, and N. Jennings. Agent-based control for decentralised demand side management in the smart grid. 2011

work page 2011
[37]

Reisinger, W

C. Reisinger, W. Stockinger, and Y. Zhang. Linear convergence of a policy gradient method for some finite horizon continuous time control problems. SIAM Journal on Control and Op- timization, 61(6):3526–3558, 2023

work page 2023
[38]

Reisinger and Y

C. Reisinger and Y. Zhang. A penalty scheme and policy iteration for nonlocal HJB varia- tional inequalities with monotone nonlinearities. Computers & Mathematics with Applications , 93:199–213, 2021

work page 2021
[39]

Santambrogio and W

F. Santambrogio and W. Shim. A cucker–smale inspired deterministic mean field game with velocity interactions. SIAM Journal on Control and Optimization , 59(6):4155–4187, 2021

work page 2021
[40]

Sethi, D

D. Sethi, D. ˇSiˇ ska, and Y. Zhang. Entropy annealing for policy mirror descent in continuous time and space. arXiv preprint arXiv:2405.20250 , 2024

work page arXiv 2024
[41]

Srikantha and D

P. Srikantha and D. Kundur. Resilient distributed real-time demand response via population games. IEEE Transactions on Smart Grid , 8(6):2532–2543, 2016

work page 2016
[42]

Sun, P.-Y

L. Sun, P.-Y. Hung, C. Wang, M. Tomizuka, and Z. Xu. Distributed multi-agent interaction generation with imagined potential games. arXiv preprint arXiv:2310.01614 , 2023

work page arXiv 2023
[43]

L. Sun, Y. Wang, P.-Y. Hung, C. Wang, X. Zhang, Z. Xu, and M. Tomizuka. Imagined potential games: A framework for simulating, learning and evaluating interactive behaviors. arXiv preprint arXiv:2411.03669 , 2024

work page arXiv 2024
[44]

Tordeux, M

A. Tordeux, M. Chraibi, and A. Seyfried. Collision-free speed model for pedestrian dynamics. In Traffic and Granular Flow’15, pages 225–232. Springer, 2016

work page 2016
[45]

Tushar, T

W. Tushar, T. K. Saha, C. Yuen, D. Smith, and H. V. Poor. Peer-to-peer trading in electricity networks: An overview. IEEE transactions on smart grid , 11(4):3185–3200, 2020

work page 2020
[46]

Yong and X

J. Yong and X. Y. Zhou. Stochastic controls: Hamiltonian systems and HJB equations , vol- ume 43. Springer Science & Business Media, 2012. Appendix A. Implementation of Algorithm 1 for Crowd-Motion Games To implement Algorithm 1, we uniformly discretize the time interval [0 , 1] into L = 50 steps. The batch size M, representing the number of simulated tra...

work page 2012

[1] [1]

Aghajani and A

A. Aghajani and A. Doustmohammadi. Formation control of multi-vehicle systems using co- operative game theory. In 2015 15th International Conference on Control, Automation and Systems (ICCAS), pages 704–709. IEEE, 2015

work page 2015

[2] [2]

Aurell, R

A. Aurell, R. Carmona, and M. Lauriere. Stochastic graphon games: II. the linear-quadratic case. Applied Mathematics & Optimization , 85(3):39, 2022

work page 2022

[3] [3]

Aurell and B

A. Aurell and B. Djehiche. Mean-field type modeling of nonlocal crowd aversion in pedestrian crowd dynamics. SIAM Journal on Control and Optimization , 56(1):434–455, 2018

work page 2018

[4] [4]

Barles, R

G. Barles, R. Buckdahn, and E. Pardoux. Backward stochastic differential equations and integral-partial differential equations. Stochastics: An International Journal of Probability and Stochastic Processes, 60(1-2):57–83, 1997

work page 1997

[5] [5]

A. Blum, E. Even-Dar, and K. Ligett. Routing without regret: On convergence to nash equilibria of regret-minimizing algorithms in routing games. Theory of Computing , 6(1):179– 199, 2010

work page 2010

[6] [6]

R. Carmona. Lectures on BSDEs, stochastic control, and stochastic differential games with financial applications. SIAM, 2016

work page 2016

[7] [7]

Carmona, Q

R. Carmona, Q. Cormier, and H. M. Soner. Synchronization in a kuramoto mean field game. Communications in Partial Differential Equations , 48(9):1214–1244, 2023

work page 2023

[8] [8]

Carmona and F

R. Carmona and F. Delarue. Probabilistic Theory of Mean Field Games with Applications I: Mean Field FBSDEs, Control, and Games , volume 83. Springer, 2018

work page 2018

[9] [9]

Colombo and D

A. Colombo and D. Del Vecchio. Efficient algorithms for collision avoidance at intersections. In Proceedings of the 15th ACM international conference on Hybrid Systems: Computation and Control, pages 145–154, 2012

work page 2012

[10] [10]

Cont and E

R. Cont and E. Voltchkova. A finite difference scheme for option pricing in jump diffusion and exponential L´ evy models.SIAM Journal on Numerical Analysis , 43(4):1596–1626, 2005. α-POTENTIAL DISTRIBUTED GAMES WITH JUMPS 21

work page 2005

[11] [11]

Dumitrescu, M.-C

R. Dumitrescu, M.-C. Quenez, and A. Sulem. A weak dynamic programming principle for combined optimal stopping/stochastic control with E f-expectations. SIAM Journal on Control and Optimization, 54(4):2090–2115, 2016

work page 2090

[12] [12]

Dumitrescu, M.-C

R. Dumitrescu, M.-C. Quenez, and A. Sulem. Mixed generalized Dynkin game and stochastic control in a Markovian framework. Stochastics, 89(1):400–429, 2017

work page 2017

[13] [13]

Dumitrescu, C

R. Dumitrescu, C. Reisinger, and Y. Zhang. Approximation schemes for mixed optimal stop- ping and control problems with nonlinear expectations and jumps. Applied Mathematics & Optimization, 83(3):1387–1429, 2021

work page 2021

[14] [14]

Giegrich, C

M. Giegrich, C. Reisinger, and Y. Zhang. Convergence of policy gradient methods for finite- horizon exploratory linear-quadratic control problems. SIAM Journal on Control and Opti- mization, 62(2):1060–1092, 2024

work page 2024

[15] [15]

X. Guo, X. Li, C. Maheshwari, S. Sastry, and M. Wu. Markov α-potential games: Equilibrium approximation and regret analysis. arXiv preprint arXiv:2305.12553 , 2023

work page arXiv 2023

[16] [16]

X. Guo, X. Li, and L. Zhang. Bsde approach for α-potential stochastic differential games. arXiv preprint arXiv:2507.13256 , 2025

work page arXiv 2025

[17] [17]

X. Guo, X. Li, and Y. Zhang. An α-potential game framework for N-player games. arXiv:2403.16962, 2024

work page arXiv 2024

[18] [18]

Guo and Y

X. Guo and Y. Zhang. Towards an analytical framework for dynamic potential games. SIAM Journal on Control and Optimization , 63(2):1213–1242, 2025

work page 2025

[19] [19]

R. Hu. Deep fictitious play for stochastic differential games. Communications in Mathematical Sciences, 19(2):325–353, 2021

work page 2021

[20] [20]

Jackson and D

J. Jackson and D. Lacker. Approximately optimal distributed stochastic controls beyond the mean field setting. arXiv preprint arXiv:2301.02901 , 2023

work page arXiv 2023

[21] [21]

E. R. Jakobsen and K. H. Karlsen. Continuous dependence estimates for viscosity solutions of integro-pdes. Journal of Differential Equations , 212(2):278–318, 2005

work page 2005

[22] [22]

Kalaria, C

D. Kalaria, C. Maheshwari, and S. Sastry. α-racer: Real-time algorithm for game-theoretic motion planning and control in autonomous racing using near-potential function.arXiv preprint arXiv:2412.08855, 2024

work page arXiv 2024

[23] [23]

Kavuncu, A

T. Kavuncu, A. Yaraneri, and N. Mehr. Potential iLQR: A potential-minimizing controller for planning multi-agent interactive trajectories. arXiv preprint arXiv:2107.04926 , 2021

work page arXiv 2021

[24] [24]

D. P. Kingma and J. Ba. Adam: A method for stochastic optimization, 2017

work page 2017

[25] [25]

Krichene, W

S. Krichene, W. Krichene, R. Dong, and A. Bayen. Convergence of heterogeneous distributed learning in stochastic routing games. In 2015 53rd Annual Allerton Conference on Communi- cation, Control, and Computing (Allerton) , pages 480–487. IEEE, 2015

work page 2015

[26] [26]

H. Kunita. Stochastic Differential Equations Based on L´ evy Processes and Stochastic Flows of Diffeomorphisms, pages 305–373. Birkh¨ auser Boston, Boston, MA, 2004

work page 2004

[27] [27]

Lachapelle and M.-T

A. Lachapelle and M.-T. Wolfram. On a mean field game approach modeling congestion and aversion in pedestrian crowds. Transportation research part B: methodological , 45(10):1572– 1589, 2011

work page 2011

[28] [28]

L. Lu, R. Hu, X. Yang, and Y. Zhu. Multiagent relative investment games in a jump diffusion market with deep reinforcement learning algorithm. SIAM Journal on Financial Mathematics , 16(2):707–746, 2025

work page 2025

[29] [29]

Z. Ma, D. S. Callaway, and I. A. Hiskens. Decentralized charging control of large populations of plug-in electric vehicles. IEEE Transactions on control systems technology , 21(1):67–78, 2011

work page 2011

[30] [30]

Maheshwari, M

C. Maheshwari, M. Wu, and S. Sastry. Convergence of decentralized actor-critic algorithm in general-sum markov games. IEEE Control Systems Letters , 2024

work page 2024

[31] [31]

Mazumdar, L

E. Mazumdar, L. J. Ratliff, M. I. Jordan, and S. S. Sastry. Policy-gradient algorithms have no guarantees of convergence in linear quadratic games. In Proceedings of the 19th International Conference on Autonomous Agents and MultiAgent Systems , pages 860–868, 2020. 22 α-POTENTIAL DISTRIBUTED GAMES WITH JUMPS

work page 2020

[32] [32]

Narasimha, K

D. Narasimha, K. Lee, D. Kalathil, and S. Shakkottai. Multi-agent learning via markov poten- tial games in marketplaces for distributed energy resources. In 2022 IEEE 61st Conference on Decision and Control (CDC) , pages 6350–6357. IEEE, 2022

work page 2022

[33] [33]

Nourian, P

M. Nourian, P. E. Caines, and R. P. Malham´ e. Mean field analysis of controlled Cucker- Smale type flocking: Linear analysis and perturbation equations. IFAC Proceedings Volumes, 44(1):4471–4476, 2011

work page 2011

[34] [34]

Paccagnan, M

D. Paccagnan, M. Kamgarpour, and J. Lygeros. On aggregative and mean field games with applications to electricity markets. In 2016 European Control Conference (ECC) , pages 196–

work page 2016

[35] [35]

H. Pham. Optimal stopping of controlled jump diffusion processes: a viscosity solution ap- proach. J. Math. Syst. Estimat. Control , 8(1):1, 1998

work page 1998

[36] [36]

Ramchurn, P

S. Ramchurn, P. Vytelingum, A. Rogers, and N. Jennings. Agent-based control for decentralised demand side management in the smart grid. 2011

work page 2011

[37] [37]

Reisinger, W

C. Reisinger, W. Stockinger, and Y. Zhang. Linear convergence of a policy gradient method for some finite horizon continuous time control problems. SIAM Journal on Control and Op- timization, 61(6):3526–3558, 2023

work page 2023

[38] [38]

Reisinger and Y

C. Reisinger and Y. Zhang. A penalty scheme and policy iteration for nonlocal HJB varia- tional inequalities with monotone nonlinearities. Computers & Mathematics with Applications , 93:199–213, 2021

work page 2021

[39] [39]

Santambrogio and W

F. Santambrogio and W. Shim. A cucker–smale inspired deterministic mean field game with velocity interactions. SIAM Journal on Control and Optimization , 59(6):4155–4187, 2021

work page 2021

[40] [40]

Sethi, D

D. Sethi, D. ˇSiˇ ska, and Y. Zhang. Entropy annealing for policy mirror descent in continuous time and space. arXiv preprint arXiv:2405.20250 , 2024

work page arXiv 2024

[41] [41]

Srikantha and D

P. Srikantha and D. Kundur. Resilient distributed real-time demand response via population games. IEEE Transactions on Smart Grid , 8(6):2532–2543, 2016

work page 2016

[42] [42]

Sun, P.-Y

L. Sun, P.-Y. Hung, C. Wang, M. Tomizuka, and Z. Xu. Distributed multi-agent interaction generation with imagined potential games. arXiv preprint arXiv:2310.01614 , 2023

work page arXiv 2023

[43] [43]

L. Sun, Y. Wang, P.-Y. Hung, C. Wang, X. Zhang, Z. Xu, and M. Tomizuka. Imagined potential games: A framework for simulating, learning and evaluating interactive behaviors. arXiv preprint arXiv:2411.03669 , 2024

work page arXiv 2024

[44] [44]

Tordeux, M

A. Tordeux, M. Chraibi, and A. Seyfried. Collision-free speed model for pedestrian dynamics. In Traffic and Granular Flow’15, pages 225–232. Springer, 2016

work page 2016

[45] [45]

Tushar, T

W. Tushar, T. K. Saha, C. Yuen, D. Smith, and H. V. Poor. Peer-to-peer trading in electricity networks: An overview. IEEE transactions on smart grid , 11(4):3185–3200, 2020

work page 2020

[46] [46]

Yong and X

J. Yong and X. Y. Zhou. Stochastic controls: Hamiltonian systems and HJB equations , vol- ume 43. Springer Science & Business Media, 2012. Appendix A. Implementation of Algorithm 1 for Crowd-Motion Games To implement Algorithm 1, we uniformly discretize the time interval [0 , 1] into L = 50 steps. The batch size M, representing the number of simulated tra...

work page 2012