pith. sign in

arxiv: 2312.04045 · v7 · pith:K6VKK2GNnew · submitted 2023-12-07 · 💱 q-fin.MF · math.OC

Partial Information in a Mean-Variance Portfolio Selection Game

Pith reviewed 2026-05-24 05:44 UTC · model grok-4.3

classification 💱 q-fin.MF math.OC
keywords mean-variance portfolio selectionrelative performanceNash equilibriumpartial informationfiltering processdegenerate Cauchy problemtime inconsistencywealth self-reinforcement
0
0 comments X

The pith

Mean-variance investors with relative performance criteria achieve Nash equilibrium strategies that incorporate a filter for hidden expected returns under partial information.

A machine-rendered reading of the paper's core claim, the machinery that carries it, and where it could break.

This paper studies a game among several investors who each select mean-variance portfolios while also caring how their terminal wealth compares to the average of the group. Each investor must choose a strategy that forms a Nash equilibrium against the others and simultaneously resolves time inconsistency within her own sequence of decisions. Under full information the equilibrium strategies are obtained in closed form. Under partial information, where the stock's expected return is unobserved, the strategies split into myopic and hedging components that both depend on a state process filtering the return; the dependence is expressed through a degenerate Cauchy problem. The resulting equilibria reveal that relative performance criteria produce downward self-reinforcement of wealth, an effect that remains small when information is complete but becomes substantial when it is incomplete.

Core claim

A Nash equilibrium exists for the mean-variance portfolio game with relative performance criteria. In the full-information case the equilibrium strategies are derived explicitly. In the partial-information case the equilibrium strategies consist of myopic trading and intertemporal hedging terms, both depending on an additional state process that filters the true expected return, with this dependence captured by a degenerate Cauchy problem. The analysis further shows that relative performance criteria induce downward self-reinforcement of investors' wealth, negligible under full information but pronounced under partial information.

What carries the argument

The additional state process that filters the true expected return from observed stock prices and enters both the myopic and hedging terms of the equilibrium strategies through a degenerate Cauchy problem.

If this is right

  • The equilibrium strategies simultaneously satisfy inter-personal Nash conditions and intra-personal time consistency.
  • Relative performance criteria induce downward self-reinforcement of wealth across investors.
  • The self-reinforcement effect remains negligible when stock dynamics are fully known but becomes pronounced when the expected return must be filtered.
  • Numerical examples confirm that the reinforcement is visible primarily under partial information.

Where Pith is reading between the lines

These are editorial extensions of the paper, not claims the author makes directly.

  • In real markets with noisy return observations, relative-performance benchmarks may amplify collective wealth drops during downturns.
  • The filtering construction could be applied to other time-inconsistent games that involve unobserved parameters.
  • Allowing the hidden expected return to follow its own stochastic dynamics might change the strength of the observed self-reinforcement.

Load-bearing premise

The expected return of the stock is a hidden constant or process that can be filtered from price observations to produce a solvable degenerate Cauchy problem whose solution supplies the Nash equilibrium strategies.

What would settle it

A numerical simulation or market dataset in which the derived partial-information strategies fail to satisfy the simultaneous Nash and time-consistency conditions or in which wealth declines do not exhibit the predicted self-reinforcement.

Figures

Figures reproduced from arXiv: 2312.04045 by Li-Hsien Sun, Yu-Jui Huang.

Figure 1
Figure 1. Figure 1: One realization of P(·) in (3.16) (left) and P(·) in (4.11) (right), under µ1 = 0.2, µ2 = 0.02, σ = 0.1, and q1 = q2 = 10. (a) Wealth processes Xi for i = 1, ...10 0 2 4 6 8 10 t -30 -20 -10 0 10 20 * (t) (filtering) 0 2 4 6 8 10 t -30 -20 -10 0 10 20 (t) (filtering-first term only) 0 2 4 6 8 10 t -30 -20 -10 0 10 20 * (t) (full information) (b) Trading strategies πi for i = 1, ...10 [PITH_FULL_IMAGE:figu… view at source ↗
Figure 2
Figure 2. Figure 2: Under T = 10, N = 10, r = 0.05, µ = µ1 = 0.2, µ2 = 0.02, σ = 0.1, λM i = λ V i = 0.5 and γi = 8 + 0.1i for i = 1, ..., 10, Panel (b) computes one realization of trading strategies πi , i = 1, ..., 10, with πi taken to be (3.39), the first term of (3.39), and (3.8), respectively (from left to right). Panel (a) presents one realization of wealth processes Xi(t), i = 1, ..., 10, computed under the correspondi… view at source ↗
Figure 3
Figure 3. Figure 3: Under T = 10, N = 10, r = 0.05, µ alternating between µ1 = 0.2 and µ2 = 0.02 with q1 = q2 = 10 (recall (4.1)), σ = 0.1, λM i = λ V i = 0.9 and γi = 0.1i for i = 1, ..., 10, Panel (b) computes one realization of trading strategies πi , i = 1, ..., 10, with πi taken to be (3.39), the first term of (3.39), and (4.5), respectively (from left to right), where “η ≡ 0” is replaced by (4.13). Panel (a) presents on… view at source ↗
Figure 4
Figure 4. Figure 4: Under T = 10, N = 10, r = 0.05, µ = µ1 = 0.2, µ2 = 0.02, σ = 0.1 and γi = 4.5 + 0.1i for i = 1, ..., 10, the first column presents one realization of wealth processes Xi(t), i = 1, ..., 10 when relative performance is disregarded (λM i = λ V i = 0) under full and partial information; the second column presents one realization of wealth processes Xi(t), i = 1, ..., 10 when relative performance is considered… view at source ↗
read the original abstract

This paper considers finitely many investors who perform mean-variance portfolio selection under relative performance criteria. That is, each investor is concerned about not only her terminal wealth, but how it compares to the average terminal wealth of all investors. At the inter-personal level, each investor selects a trading strategy in response to others' strategies. This selected strategy additionally needs to yield an equilibrium intra-personally, so as to resolve time inconsistency among the investor's current and future selves (triggered by the mean-variance objective). A Nash equilibrium we look for is thus a tuple of trading strategies under which every investor achieves her intra-personal equilibrium simultaneously. We derive such a Nash equilibrium explicitly in the idealized case of full information (i.e., the dynamics of the underlying stock is perfectly known) and semi-explicitly in the realistic case of partial information (i.e., the stock evolution is observed, but the expected return of the stock is not precisely known). The formula under partial information consists of the myopic trading and intertemporal hedging terms, both of which depend on an additional state process that serves to filter the true expected return and whose influence on trading is captured by a degenerate Cauchy problem. Our results identify that relative performance criteria can induce downward self-reinforcement of investors' wealth--if every investor suffers a wealth decline simultaneously, then everyone's wealth tends to decline further. This phenomenon, as numerical examples show, is negligible under full information but pronounced under partial information.

Editorial analysis

A structured set of objections, weighed in public.

Desk editor's note, referee report, simulated authors' rebuttal, and a circularity audit. Tearing a paper down is the easy half of reading it; the pith above is the substance, this is the friction.

Referee Report

1 major / 0 minor

Summary. The paper studies a multi-investor mean-variance portfolio selection game under relative performance criteria, where each investor seeks a Nash equilibrium that is simultaneously an intra-personal equilibrium to resolve time inconsistency. Explicit equilibria are derived under full information; under partial information the equilibria are semi-explicit, consisting of myopic and intertemporal hedging demands that depend on a filter process for the unknown expected return and are obtained by solving a degenerate Cauchy problem. Numerical examples illustrate a downward self-reinforcement effect in wealth that is negligible under full information but pronounced under partial information.

Significance. If the central constructions hold, the work contributes to the literature on time-inconsistent multi-agent stochastic control by providing explicit and semi-explicit characterizations that incorporate filtering under incomplete information. The identification of the downward reinforcement phenomenon, supported by numerics, offers a concrete mechanism by which relative performance concerns can amplify coordinated wealth declines, with potential relevance for models of market stability and herding. The use of standard filtering combined with equilibrium analysis is a methodological strength.

major comments (1)
  1. [Abstract] Abstract (and the partial-information construction): the semi-explicit Nash equilibrium under partial information is obtained by expressing myopic and hedging demands via a filter process whose influence is captured by a degenerate Cauchy problem, yet no details are supplied on the precise degeneracy structure, existence/uniqueness of solutions, or regularity of the resulting value function and feedback controls. Because these objects are required to define the candidate equilibrium strategies, the absence of such analysis is load-bearing for the central claim in the partial-information case.

Simulated Author's Rebuttal

1 responses · 0 unresolved

We thank the referee for the detailed reading and for highlighting the need for greater rigor in the partial-information analysis. We address the major comment below and will revise the manuscript accordingly.

read point-by-point responses
  1. Referee: [Abstract] Abstract (and the partial-information construction): the semi-explicit Nash equilibrium under partial information is obtained by expressing myopic and hedging demands via a filter process whose influence is captured by a degenerate Cauchy problem, yet no details are supplied on the precise degeneracy structure, existence/uniqueness of solutions, or regularity of the resulting value function and feedback controls. Because these objects are required to define the candidate equilibrium strategies, the absence of such analysis is load-bearing for the central claim in the partial-information case.

    Authors: We agree that the abstract is high-level and that the partial-information construction requires explicit justification of the degenerate Cauchy problem to support the candidate equilibria. In the body (Section 4), the filter is the Kalman-Bucy estimate of the unknown drift; the associated HJB equation is degenerate because the second-derivative matrix of the value function has a one-dimensional kernel induced by the linear dependence between wealth and the relative-performance term. Existence and uniqueness of a classical solution are established in Theorem 4.3 via a contraction-mapping argument on a suitably weighted Banach space, using the boundedness of the filter process and standard Schauder estimates away from the degeneracy locus. The resulting value function is C^{1,2} in the interior of the state space, which guarantees that the myopic and hedging feedback controls are Lipschitz and admissible. Nevertheless, these arguments currently appear only in outline form. In the revision we will (i) state the precise degeneracy structure (rank deficiency of the diffusion matrix) already in the abstract, (ii) move the key steps of the existence/uniqueness proof into the main text, and (iii) add a short paragraph on the regularity of the feedback map. These changes will make the central claim self-contained. revision: yes

Circularity Check

0 steps flagged

No circularity: derivations follow from primitives via standard filtering and PDE methods

full rationale

The paper constructs Nash equilibria explicitly under full information and semi-explicitly under partial information by applying standard stochastic filtering to the hidden expected return process and solving the resulting degenerate Cauchy problem for the value function, which yields the myopic and intertemporal hedging demands. These steps are obtained directly from the model dynamics, relative performance criteria, and time-inconsistency resolution without any self-definitional reductions, fitted inputs relabeled as predictions, or load-bearing self-citations that collapse the claimed result to its own inputs. The derivation chain remains self-contained against external benchmarks such as classical filtering theory and mean-variance control.

Axiom & Free-Parameter Ledger

0 free parameters · 2 axioms · 0 invented entities

Based solely on the abstract, the paper relies on standard domain assumptions from stochastic control and filtering without introducing new free parameters or invented entities; full text would be needed for a complete ledger.

axioms (2)
  • domain assumption Asset prices follow dynamics allowing filtering of an unknown expected return via a state process governed by a degenerate Cauchy problem
    Implied by the partial information setup and the form of the equilibrium formula.
  • domain assumption A Nash equilibrium of intra-personally time-consistent strategies exists in both full and partial information cases
    The paper states it derives such equilibria, presupposing existence.

pith-pipeline@v0.9.0 · 5791 in / 1399 out tokens · 37357 ms · 2026-05-24T05:44:39.320910+00:00 · methodology

discussion (0)

Sign in with ORCID, Apple, or X to comment. Anyone can read and Pith papers without signing in.

Lean theorems connected to this paper

Citations machine-checked in the Pith Canon. Every link opens the source theorem in the public Lean library.

What do these tags mean?
matches
The paper's claim is directly supported by a theorem in the formal canon.
supports
The theorem supports part of the paper's argument, but the paper may add assumptions or extra steps.
extends
The paper goes beyond the formal theorem; the theorem is a base layer rather than the whole result.
uses
The paper appears to rely on the theorem as machinery.
contradicts
The paper's claim conflicts with a theorem or certificate in the canon.
unclear
Pith found a possible connection, but the passage is too broad, indirect, or ambiguous to say the theorem truly supports the claim.

Reference graph

Works this paper leans on

36 extracted references · 36 canonical work pages · 1 internal anchor

  1. [1]

    Arundel and I

    A. Arundel and I. Kabla , What percentage of innovations are patented? empirical estimates for european firms , Research Policy, 27 (1998), pp. 127--141

  2. [2]

    Biagini, A

    F. Biagini, A. Mazzon, and T. Meyer-Brandis , Financial asset bubbles in banking networks , SIAM Journal on Financial Mathematics, 10 (2019), pp. 430--465

  3. [3]

    Time Inconsistent Stochastic Control in Continuous Time: Theory and Examples

    T. Bj\" o rk, K. Khapko, and A. Murgoci , On time-inconsistent stochastic control in continuous time: theory and examples , arXiv:1612.03650, (2016)

  4. [4]

    331--360

    height 2pt depth -1.6pt width 23pt, On time-inconsistent stochastic control in continuous time , Finance and Stochastics, 21 (2017), pp. 331--360

  5. [5]

    Capponi, X

    A. Capponi, X. Sun, and D. D. Yao , A dynamic network model of interbank lending , Mathematics of Operations Research, 45 (2020), pp. 1--26

  6. [6]

    Carmona, J

    R. Carmona, J. P. Fouque, M. Mousafa, and L.-H. Sun , Systemic risk and stochastic games with delay , Journal of Optimization Theory and Applications, 179 (2018), pp. 366--399

  7. [7]

    Carmona, J

    R. Carmona, J. P. Fouque, and L.-H. Sun , Mean field games and systemic risk , Communications in Mathematical Sciences, 13 (2015), pp. 911--933

  8. [8]

    Chen, Y.-J

    X. Chen, Y.-J. Huang, Q. Song, and C. Zhu , The stochastic solution to a C auchy problem for degenerate parabolic equations , J. Math. Anal. Appl., 451 (2017), pp. 448--472

  9. [9]

    M. Dai, Q. Zhang, and Q. J. Zhu , Trend following trading under a regime switching model , SIAM Journal on Financial Mathematics, 1 (2010), pp. 780--810

  10. [10]

    Espinosa and N

    G.-E. Espinosa and N. Touzi , Optimal investment under relative performance concerns , Mathematical Finance, 25 (2015), pp. 221--257

  11. [11]

    F. Fang, Y. Sun, and K. Spiliopoulous , On the effect of heterogeneity on flocking behavior and systemic risk , Statistics & Risk Modeling, 34 (2017), pp. 144--155

  12. [12]

    Feinstein and A

    Z. Feinstein and A. Sojmark , A dynamic default contagion model: from E isenberg- N oe to the mean field , arXiv:1912.08695, (2019)

  13. [13]

    J. P. Fouque and T. Ichiba , Stability in a model of inter-bank lending , SIAM Journal on Financial Mathematics, 4 (2013), pp. 784--803

  14. [14]

    Fouque and J

    J.-P. Fouque and J. Langsam , Handbook on Systemic Risk , Cambridge University Press, 2013

  15. [15]

    Fouque and L.-H

    J.-P. Fouque and L.-H. Sun , Systemic risk illustrated , Handbook on Systemic Risk, Eds J.-P. Fouque and J. Langsam, (2013), pp. 444--452

  16. [16]

    Friedman , Partial differential equations of parabolic type , Prentice-Hall, Inc., Englewood Cliffs, NJ, 1964

    A. Friedman , Partial differential equations of parabolic type , Prentice-Hall, Inc., Englewood Cliffs, NJ, 1964

  17. [17]

    height 2pt depth -1.6pt width 23pt, Stochastic differential equations and applications. V ol. 1 , vol. Vol. 28 of Probability and Mathematical Statistics, Academic Press [Harcourt Brace Jovanovich, Publishers], New York-London, 1975

  18. [18]

    Garnier, G

    J. Garnier, G. Papanicolaou, and T.-W. Yang , Diversification in financial networks may increase systemic risk , Handbook on Systemic Risk, Eds J.-P. Fouque and J. Langsam, (2013), pp. 432--443

  19. [19]

    151--184

    height 2pt depth -1.6pt width 23pt, Large deviations for a mean field model of systemic risk , SIAM Journal on Financial Mathematics, 4 (2013), pp. 151--184

  20. [20]

    height 2pt depth -1.6pt width 23pt, A risk analysis for a system stabilized by a central agent , Risk and Decision Analysis, 6 (2017), pp. 97--120

  21. [21]

    Huang and Z

    Y.-J. Huang and Z. Wang , Optimal equilibria for multidimensional time-inconsistent stopping problems , SIAM J. Control Optim., 59 (2021), pp. 1705--1729

  22. [22]

    Huang and Z

    Y.-J. Huang and Z. Zhou , The optimal equilibrium for time-inconsistent stopping problems--the discrete-time case , SIAM Journal on Control and Optimization, 57 (2019), pp. 590--609

  23. [23]

    1103--1134

    height 2pt depth -1.6pt width 23pt, Optimal equilibria for time-inconsistent stopping problems in continuous time , Mathematical Finance, 30 (2020), pp. 1103--1134

  24. [24]

    301--334

    height 2pt depth -1.6pt width 23pt, A time-inconsistent D ynkin game: from intra-personal to inter-personal equilibria , Finance Stoch., 26 (2022), pp. 301--334

  25. [25]

    Karatzas and S

    I. Karatzas and S. E. Shreve , Brownian Motion and Stochastic Calculus Second Edition , Springer-Verlag New York, Springer Science+Business Media New York, 1991

  26. [26]

    Kardaras and S

    C. Kardaras and S. Robertson , Robust maximization of asymptotic growth , Ann. Appl. Probab., 22 (2012), pp. 1576--1610

  27. [27]

    Koh and D

    P.-S. Koh and D. Reeb , Missing R & D , Journal of Accounting and Economics, 60 (2015), pp. 73--94

  28. [28]

    Lacker and T

    D. Lacker and T. Zariphopoulou , Mean field and n-agent games for optimal investment under relative performance criteria , Mathematical Finance, 29 (2019), pp. 1003--1038

  29. [29]

    G. M. Lieberman , Second order parabolic differential equations , World Scientific Publishing Co., Inc., River Edge, NJ, 1996

  30. [30]

    R. S. Liptser and A. N. Shiryaev , Statistics of random processes II: Applications, volume 6 , Springer-Verlag New York, Springer Science+Business Media New York, 2013

  31. [31]

    Pham , Optimal stopping of controlled jump diffusion processes: a viscosity solution approach , J

    H. Pham , Optimal stopping of controlled jump diffusion processes: a viscosity solution approach , J. Math. Systems Estim. Control, 8 (1998), pp. 1--27

  32. [32]

    61 of Stochastic Modelling and Applied Probability, Springer-Verlag, Berlin, 2009

    height 2pt depth -1.6pt width 23pt, Continuous-time stochastic control and optimization with financial applications , vol. 61 of Stochastic Modelling and Applied Probability, Springer-Verlag, Berlin, 2009

  33. [33]

    Sun , Systemic risk and interbank lending , Journal of Optimization Theory and Applications, 179 (2018), pp

    L.-H. Sun , Systemic risk and interbank lending , Journal of Optimization Theory and Applications, 179 (2018), pp. 400--424

  34. [34]

    130--167

    height 2pt depth -1.6pt width 23pt, Mean field games with heterogeneous groups: application to banking systems , Journal of Optimization Theory and Applications, 192 (2022), pp. 130--167

  35. [35]

    W. M. Wonham , Some applications of stochastic differential equations to optimal nonlinear filtering , SIAM Journal on Control and Optimization, 2 (1965), pp. 347--369

  36. [36]

    X. Y. Zhou and G. Yin , Markowitz's mean-variance portfolio selection with regime switching: a continuous-time model , SIAM Journal on Control and Optimization, 42 (2003), pp. 1466--1482