pith. sign in

arxiv: 2606.24562 · v1 · pith:GLMJPAT6new · submitted 2026-06-23 · 🧮 math.CO · math.PR· q-bio.PE

A parameterized family of balance indices for phylogenetic networks

Pith reviewed 2026-06-25 23:11 UTC · model grok-4.3

classification 🧮 math.CO math.PRq-bio.PE
keywords phylogenetic networksbalance indicesgrafting propertyB2 indexSackin indexrandom phylogenetic treesGalton-Watson treesYule model
0
0 comments X

The pith

The H_α family extends the B2 balance index to phylogenetic networks via a grafting property that decomposes the index across biconnected components.

A machine-rendered reading of the paper's core claim, the machinery that carries it, and where it could break.

The paper defines a one-parameter family of balance indices H_α for phylogenetic networks, with α a positive real number. When α equals 1 the family recovers the B2 index; other values supply a direct generalization of the Sackin index. The central technical result is that every H_α satisfies a grafting property, so the index of any network equals a simple function of the indices of its biconnected components. The property is then used to locate the networks that minimize and maximize H_α inside several natural classes and to compute exact distributions under standard random models, including the Yule and PDA models. Local-limit arguments further yield asymptotic moment formulas for large networks constructed as blowups of Galton-Watson trees.

Core claim

The H_α indices are defined for phylogenetic networks and inherit the grafting property from the B2 index, which expresses the index of the whole network in terms of the indices of its biconnected components. This enables results on extrema and distributions for various network classes and random models.

What carries the argument

The grafting property, which allows expressing the H_α index of a network in terms of the H_α indices of its biconnected components.

If this is right

  • Minimizers and maximizers of H_α exist and can be characterized inside each fixed class of phylogenetic networks.
  • Exact distributions of H_α are obtainable for Galton-Watson trees and binary Markov branching trees, with explicit formulas under the Yule and PDA models.
  • Local-limit techniques produce the asymptotic growth rate and limiting moments of H_α on large random networks.
  • Moment formulas hold for the broader class of blowups of Galton-Watson trees.

Where Pith is reading between the lines

These are editorial extensions of the paper, not claims the author makes directly.

  • Different choices of α may weight local versus global imbalance differently, offering a tunable diagnostic for network shape.
  • The same grafting decomposition could be tested on balance indices already defined for directed or weighted networks.
  • If the grafting property survives mild relaxations of the network definition, the same extremal and distributional results would apply to a larger set of biological graphs.

Load-bearing premise

The grafting property holds for every member of the H_α family.

What would settle it

An explicit phylogenetic network together with a value of α for which the H_α value cannot be recovered from the H_α values of its biconnected components.

Figures

Figures reproduced from arXiv: 2606.24562 by Fran\c{c}ois Bienvenu, Hadrien Maffioli, Jean-Jil Duchamps.

Figure 1
Figure 1. Figure 1: Two binary trees T and T ′ such that Hα(T) − Hα(T ′ ) changes sign as α varies. The following proposition is the key to our study of the Hα index, as it makes it possible to express the Hα index of a network in terms of the Hα indices of its biconnected components, thereby opening the door to recursive computations. Proposition 1.5 (Grafting property). Let G1 and G2 be two phylogenetic networks. Let G be t… view at source ↗
Figure 2
Figure 2. Figure 2: Examples of phylogenetic trees and networks of special interest. [PITH_FULL_IMAGE:figures/full_fig_p006_2.png] view at source ↗
Figure 3
Figure 3. Figure 3: Representation of the blowup procedure. On the left: [PITH_FULL_IMAGE:figures/full_fig_p018_3.png] view at source ↗
Figure 4
Figure 4. Figure 4: Powers of the selected balance indices, when comparing each alternative model to the Yule [PITH_FULL_IMAGE:figures/full_fig_p021_4.png] view at source ↗
Figure 5
Figure 5. Figure 5: Powers of the selected balance indices, when comparing each alternative model to the [PITH_FULL_IMAGE:figures/full_fig_p022_5.png] view at source ↗
Figure 6
Figure 6. Figure 6: Comparison between the power of different indices as a function of the number of leaves. [PITH_FULL_IMAGE:figures/full_fig_p022_6.png] view at source ↗
Figure 7
Figure 7. Figure 7: A case where the power decreases in α and a case where it first increases. 0.0 0.5 1.0 1.5 2.0 2.5 0.0 0.2 0.4 0.6 0.8 1.0 P o w er of th e H in d e x (a) The null model is Inherited fertility and the alter￾native is Simple Brownian (both parameters equal 1). The number of leaves of each simulated tree is 30. 0.0 0.5 1.0 1.5 2.0 2.5 0.0 0.2 0.4 0.6 0.8 1.0 P o w er of th e H in d e x (b) Same models but th… view at source ↗
Figure 8
Figure 8. Figure 8: Some cases where the optimal α is not constant for the number of leaves. 23 [PITH_FULL_IMAGE:figures/full_fig_p023_8.png] view at source ↗
Figure 9
Figure 9. Figure 9: The null model is β-splitting with β = −1 and the alternative is Simple Brownian (with parameter 1). The number of leaves of each simulated tree is 30. The power increases with α. To summarize, the results obtained here are in the continuity of the conclusions of Kersting et al. in [KWF25]: it seems that there is no ”best” index generally speaking and that performance depends on the choice of the null and … view at source ↗
Figure 10
Figure 10. Figure 10: Since the simple random walk “escapes to infinity” along either the left or the right infinite path (call such infinite paths rays), the Hα index of this network should be the same as that of the cherry – namely the structural α-entropy of the probability distribution (1/2, 1/2). In this simple example, the boundary is clear: it should consist of two “points at infinity” that correspond to each of the two… view at source ↗
Figure 11
Figure 11. Figure 11: Insertion of an edge from ⃗uw to ⃗uv (left) and from ⃗uv to ⃗uw (right). Proof: Letting pℓ, p ′ ℓ and p ′′ ℓ be the probabilities of reaching a fixed leaf ℓ in N, N′ and N′′, respec￾tively, we have p ′ ℓ + p ′′ ℓ = 2pℓ. Using the concavity of fα and summing these inequalities over the set of leaves yields Hα(N ′ ) + Hα(N ′′) ⩽ 2Hα(N), and the equality case follows from the strict concavity of fα. ■ 34 [P… view at source ↗
Figure 12
Figure 12. Figure 12: By “cutting” τ at each of the k − 1 vertices inside the path from the root to ℓ, we obtain k ordered binary trees such that, for each of these trees, one of the two children of the root is distinguished (namely, the child that corresponds to the next vertex on the path from the root of τ to ℓ) and is a leaf: see [PITH_FULL_IMAGE:figures/full_fig_p044_12.png] view at source ↗
Figure 13
Figure 13. Figure 13: Top row, the k trees obtained by “cutting” τ at each of the k internal vertices on the path from the root to ℓ. Note that, for each of these trees, one child of the root is distinguished (materialized by the bold edge), and that this child is a leaf. However, this distinguished child can be the left or the right child of the root. Bottom row, the k trees obtained after “flipping” some of them horizontally… view at source ↗
Figure 14
Figure 14. Figure 14: Top row, the Dyck paths associated to the trees in the bottom row of Figure [PITH_FULL_IMAGE:figures/full_fig_p045_14.png] view at source ↗
Figure 15
Figure 15. Figure 15: A counter example to the non lower semi-continuity. [PITH_FULL_IMAGE:figures/full_fig_p046_15.png] view at source ↗
Figure 16
Figure 16. Figure 16: A counter example to the non upper semi-continuity. [PITH_FULL_IMAGE:figures/full_fig_p047_16.png] view at source ↗
read the original abstract

We introduce a new family of balance indices for phylogenetic networks: the $H_\alpha$ indices, where $\alpha$ is a positive real number. This family includes the $B_2$ index as a special case ($\alpha = 1$) and provides a natural extension of the Sackin index to phylogenetic networks. We show that the $H_\alpha$ indices share many structural properties with the $B_2$ index, most notably a "grafting property" that makes it possible to express the $H_\alpha$ index of a network in terms of the $H_\alpha$ indices of its biconnected components. These properties allow us to identify networks that minimize / maximize $H_\alpha$ for various classes of phylogenetic networks, and to study its distribution for several models of random trees and networks (in particular, Galton-Watson trees and binary Markov branching trees, with a focus on the Yule and PDA models). Finally, we show how local limits can be used to analyze the asymptotic behavior of $H_\alpha$ for large trees and networks, and we obtain general results for the moments of $H_\alpha$ for a broad class of random phylogenetic networks known as blowups of Galton-Watson trees.

Editorial analysis

A structured set of objections, weighed in public.

Desk editor's note, referee report, simulated authors' rebuttal, and a circularity audit. Tearing a paper down is the easy half of reading it; the pith above is the substance, this is the friction.

Referee Report

0 major / 2 minor

Summary. The paper introduces the H_α family of balance indices for phylogenetic networks, parameterized by a positive real α. This recovers the B2 index at α=1 and extends the Sackin index to networks. The central structural result is a grafting property that decomposes the index over biconnected components. This property is used to characterize networks minimizing or maximizing H_α in various classes, to study its distribution under Galton-Watson trees, binary Markov branching trees (with emphasis on Yule and PDA models), and to derive asymptotic results including moments via local limits for blowups of Galton-Watson trees.

Significance. If the grafting property and derived results hold, the work supplies a tunable family of indices with clean decomposition properties that facilitate extremal and distributional analysis on networks. Strengths include the explicit recovery of known indices as special cases, the use of local limits for asymptotics, and the focus on standard random models (Yule, PDA). These features could support further applications in phylogenetics.

minor comments (2)
  1. [Abstract] The abstract refers to 'several models of random trees and networks' but then specifies only Galton-Watson and binary Markov branching; a short clarifying sentence would help.
  2. Notation for the grafting property could be introduced with a small illustrative figure or example network in the section where it is first stated.

Simulated Author's Rebuttal

0 responses · 0 unresolved

We thank the referee for their careful reading and positive evaluation of the manuscript. We are pleased that the grafting property, the recovery of known indices, and the results on extremal values and distributional properties under standard random models were viewed favorably. The recommendation to accept is appreciated.

Circularity Check

0 steps flagged

No significant circularity

full rationale

The paper defines the H_α family directly from a parameterized formula that reduces to the known B2 index at α=1 by algebraic substitution, then derives the grafting property as a theorem from that explicit definition. All subsequent results on minima, maxima, distributions, and asymptotics are obtained from the grafting decomposition and standard probabilistic arguments on the cited random models; no step renames a fitted quantity as a prediction, imports uniqueness via self-citation, or treats an ansatz as an external theorem. The derivation chain is therefore self-contained against external benchmarks.

Axiom & Free-Parameter Ledger

1 free parameters · 1 axioms · 0 invented entities

The central claim rests on the definition of H_α via a positive real parameter alpha and the domain assumption that phylogenetic networks admit a well-defined notion of biconnected components and grafting. No invented entities are introduced.

free parameters (1)
  • alpha
    Positive real parameter that defines each member of the family; chosen by the user rather than fitted to data.
axioms (1)
  • domain assumption Phylogenetic networks have well-defined biconnected components that support a grafting decomposition.
    Invoked to establish the grafting property that underpins all subsequent min/max and distributional results.

pith-pipeline@v0.9.1-grok · 5765 in / 1402 out tokens · 26021 ms · 2026-06-25T23:11:28.782473+00:00 · methodology

discussion (0)

Sign in with ORCID, Apple, or X to comment. Anyone can read and Pith papers without signing in.

Reference graph

Works this paper leans on

64 extracted references · 40 canonical work pages · 1 internal anchor

  1. [1]

    Yet Another Proof of the

    Artstein, Zvi , year = 1990, journal =. Yet Another Proof of the

  2. [2]

    Sophie Kersting and Kristina Wicke and Mareike Fischer , year =. powe

  3. [3]

    2021 , note =

    treebalance: computation of (im)balance indices , author =. 2021 , note =

  4. [4]

    and Tsikouras, P

    Sapounakis, A. and Tsikouras, P. and Tasoulas, I. and Manes, K. , title =. The Electronic Journal of Combinatorics , year =

  5. [5]

    Graphs and Combinatorics , year =

    Ferrari, Luca , title =. Graphs and Combinatorics , year =

  6. [6]

    2026 , howpublished =

  7. [7]

    , title =

    Guyer, Craig and Slowinski, Joseph B. , title =. Evolution , year =

  8. [8]

    Udny , title =

    Yule, G. Udny , title =. Philosophical Transactions of the Royal Society of London. Series B, Containing Papers of a Biological Character , year =

  9. [9]

    , title =

    Harris, Theodore E. , title =. 1963 , publisher =

  10. [10]

    Harding, E. F. , title =. Advances in Applied Probability , year =

  11. [11]

    2007 , isbn =

    Stochastic Orders , editor =. 2007 , isbn =

  12. [12]

    Sokal , title =

    Kwang-Tsao Shao and Robert R. Sokal , title =. Systematic Zoology , volume =. 1990 , doi =

  13. [13]

    Hurlbert , title =

    Stuart H. Hurlbert , title =. Ecology , volume =. 1971 , doi =

  14. [14]

    E. H. Simpson , title =. Nature , volume =. 1949 , doi =

  15. [15]

    The weighted total cophenetic index: A novel balance index for phylogenetic networks , journal =

    Linda Kn. The weighted total cophenetic index: A novel balance index for phylogenetic networks , journal =. 2024 , doi =

  16. [16]

    Local limits of conditioned Galton-Watson trees: the infinite spine case , shorttitle =

    Abraham, Romain and Delmas, Jean-François , date =. Local limits of conditioned Galton-Watson trees: the infinite spine case , shorttitle =. doi:10.1214/EJP.v19-2747 , issue =

  17. [17]

    On measures of information and their characterizations , author =

  18. [18]

    Power of Eight Tree Shape Statistics to Detect Nonrandom Diversification: A Comparison by Simulation of Two Models of Cladogenesis , shorttitle =

    Agapow, Paul-Michael and Purvis, Andy , date =. Power of Eight Tree Shape Statistics to Detect Nonrandom Diversification: A Comparison by Simulation of Two Models of Cladogenesis , shorttitle =. doi:10.1080/10635150290102564 , file =

  19. [19]

    Michael Steele

    The Objective Method: Probabilistic Combinatorial Optimization and Local Weak Convergence , shorttitle =. Probability on Discrete Structures , author =. doi:10.1007/978-3-662-09444-0_1 , isbn =

  20. [20]

    Probability Distributions on Cladograms , booktitle =

    Aldous, David , editor =. Probability Distributions on Cladograms , booktitle =. doi:10.1007/978-1-4612-0719-1_1 , isbn =

  21. [21]

    doi:10.1214/ss/998929474 , file =

    Stochastic Models and Descriptive Statistics for Phylogenetic Trees, from Yule to Today , author =. doi:10.1214/ss/998929474 , file =

  22. [22]

    The Continuum Random Tree

    Aldous, David , year = 1993, journal =. The Continuum Random Tree

  23. [23]

    Invariance Principles for

    Kortchemski, Igor , year = 2012, journal =. Invariance Principles for

  24. [24]

    Recurrence of Distributional Limits of Finite Planar Graphs , booktitle =

    Benjamini, Itai and Schramm, Oded , editor =. Recurrence of Distributional Limits of Finite Planar Graphs , booktitle =. doi:10.1007/978-1-4419-9675-6_15 , isbn =

  25. [25]

    The $B_2$ index of galled trees

    The B_2 index of galled trees. , author =. doi:10.48550/arXiv.2407.19454 , pubstate =

  26. [26]

    doi:10.1214/24-EJP1088 , issue =

    A branching process with coalescence to model random phylogenetic networks , author =. doi:10.1214/24-EJP1088 , issue =

  27. [27]

    Mathematically tractable models of random phylogenetic networks: an overview of some recent developments , shorttitle =

    Bienvenu, François , date =. Mathematically tractable models of random phylogenetic networks: an overview of some recent developments , shorttitle =. doi:10.1098/rstb.2023.0301 , file =

  28. [28]

    doi:10.1007/s00285-021-01662-7 , file =

    Revisiting Shao and Sokal's B_2 index of phylogenetic balance , author =. doi:10.1007/s00285-021-01662-7 , file =

  29. [29]

    Convergence of

    Convergence of probability measures , author =. doi:10.1002/9780470316962 , isbn =

  30. [30]

    Bingham, N. H. and Doney, R. A. , date =. Asymptotic Properties of Supercritical Branching Processes I: The Galton-Watson Process , shorttitle =. doi:10.2307/1426188 , file =

  31. [31]

    Blum, Michael G. B. and François, Olivier , date =. Which random processes describe the tree of life? A large-scale study of phylogenetic tree imbalance , shorttitle =. doi:10.1080/10635150600889625 , langid =. 16969944 , eprinttype =

  32. [32]

    The origins of the Gini index: extracts from Variabilità e Mutabilità (1912) by Corrado Gini , shorttitle =

    Ceriani, Lidia and Verme, Paolo , date =. The origins of the Gini index: extracts from Variabilità e Mutabilità (1912) by Corrado Gini , shorttitle =. doi:10.1007/s10888-011-9188-x , langid =

  33. [33]

    , namea =

    Colless, Donald H. , namea =. Review of Phylogenetics: The Theory and Practice of Phylogenetic Systematics , shorttitle =. doi:10.2307/2413420 , file =

  34. [34]

    , author =

    The asymptotic expansion of a ratio of gamma functions. , author =. doi:10.2140/pjm.1951.1.133 , file =

  35. [35]

    Tree Balance Indices: A Comprehensive Survey , shorttitle =

    Fischer, Mareike and Herbst, Lina and Kersting, Sophie and Kühn, Annemarie Luise and Wicke, Kristina , date =. Tree Balance Indices: A Comprehensive Survey , shorttitle =. doi:10.1007/978-3-031-39800-1 , isbn =

  36. [36]

    doi:10.1007/s00285-025-02205-0 , langid =

    Sackin indices for labeled and unlabeled classes of galled trees , author =. doi:10.1007/s00285-025-02205-0 , langid =

  37. [37]

    Probability: A Graduate Course , shorttitle =

    Gut, Allan , date =. Probability: A Graduate Course , shorttitle =. doi:10.1007/978-1-4614-4708-5 , isbn =

  38. [38]

    doi:10.4064/sm-70-3-231-283 , langid =

    The best constants in the Khintchine inequality , author =. doi:10.4064/sm-70-3-231-283 , langid =

  39. [39]

    Concept of structural -entropy , author =

    Quantification method of classification processes. Concept of structural -entropy , author =

  40. [40]

    doi: 10.1017/CBO9780511974076

    Huson, Daniel H. and Rupp, Regula and Scornavacca, Celine , date =. Phylogenetic Networks: Concepts, Algorithms and Applications , shorttitle =. doi:10.1017/CBO9780511974076 , isbn =

  41. [41]

    2019 , note =

    Janson, Svante , title =. 2019 , note =

  42. [42]

    2002 , pages =

    Foundations of Modern Probability , author =. doi:10.1007/978-1-4757-4015-8 , isbn =

  43. [43]

    Philosophical Transactions of the Royal Society B: Biological Sciences , volume =

    Tree balance in phylogenetic models , author =. Philosophical Transactions of the Royal Society B: Biological Sciences , volume =. 2025 , doi =

  44. [44]

    X k>τ θ2 π(k)(dπ k)2 | F π τ # + 2E

    A Limit Theorem for Multidimensional Galton-Watson Processes , author =. doi:10.1214/aoms/1177699266 , file =

  45. [45]

    doi:10.1093/sysbio/syad075 , file =

    The Limits of the Constant-rate Birth–Death Prior for Phylogenetic Tree Topology Inference , author =. doi:10.1093/sysbio/syad075 , file =

  46. [46]

    doi:10.2307/2409983 , file =

    Searching for Evolutionary Patterns in the Shape of a Phylogenetic Tree , author =. doi:10.2307/2409983 , file =

  47. [47]

    doi:10.1214/16-BJPS320 , file =

    Probabilistic models for the (sub)tree(s) of life , author =. doi:10.1214/16-BJPS320 , file =

  48. [48]

    and Olkin, Ingram and Arnold, Barry C

    Marshall, Albert W. and Olkin, Ingram and Arnold, Barry C. , editor =. Schur-Convex Functions , booktitle =. doi:10.1007/978-0-387-68276-1_3 , isbn =

  49. [49]

    doi:10.1109/tcbb.2007.1020 , file =

    Optimization Over a Class of Tree Shape Statistics , author =. doi:10.1109/tcbb.2007.1020 , file =

  50. [50]

    doi:10.1016/S0025-5564(99)00060-7 , file =

    Distributions of cherries for two models of trees , author =. doi:10.1016/S0025-5564(99)00060-7 , file =

  51. [51]

    doi:10.1086/419657 , file =

    Inferring Evolutionary Process from Phylogenetic Tree Shape , author =. doi:10.1086/419657 , file =

  52. [52]

    , date =

    Morrison, David A. , date =. Phylogenetics: The Theory and Practice of Phylogenetic Systematics, 2nd edition.—E. O. Wiley and Bruce S. Lieberman , shorttitle =. doi:10.1093/sysbio/sys065 , file =

  53. [53]

    Sur quelques applications des fonctions convexes et concaves au sens de I

    Ostrowski, Alexander , date =. Sur quelques applications des fonctions convexes et concaves au sens de I. Schur , booktitle =. doi:10.1007/978-3-0348-9342-8_29 , bookauthor =

  54. [54]

    Pavoine, Sandrine and Love, Milton and Bonsall, Michael , date =. Hierarchical partitioning of evolutionary and ecological patterns in the organization of phylogenetically-structured species assemblages: Application to rockfish (genus: Sebastes) in the Southern California Bight , shorttitle =

  55. [55]

    Towards a unifying approach to diversity measures: Bridging the gap between the Shannon entropy and Rao's quadratic index , shorttitle =

    Ricotta, Carlo and Szeidl, Laszlo , date =. Towards a unifying approach to diversity measures: Bridging the gap between the Shannon entropy and Rao's quadratic index , shorttitle =. doi:10.1016/j.tpb.2006.06.003 , file =

  56. [56]

    Vicariant Patterns and Historical Explanation in Biogeography , author =

  57. [57]

    Good” and “Bad

    “Good” and “Bad” Phenograms , author =. doi:10.1093/sysbio/21.2.225 , file =

  58. [58]

    doi:10.2307/2992186 , file =

    Tree Balance , author =. doi:10.2307/2992186 , file =

  59. [59]

    doi:10.1002/rsa.21065 , langid =

    A branching process approach to level- k phylogenetic networks , author =. doi:10.1002/rsa.21065 , langid =

  60. [60]

    Tree Balance on JSTOR , url =

  61. [61]

    Entropic nonextensivity: a possible measure of complexity ,

    Tsallis, Constantino ,. Entropic nonextensivity: a possible measure of complexity ,. doi:10.1016/S0960-0779(01)00019-4 ,

  62. [62]

    Possible generalization of Boltzmann–Gibbs statistics,

    Possible generalization of Boltzmann-Gibbs statistics , author =. doi:10.1007/BF01016429 , langid =

  63. [63]

    van der Hofstad,Random Graphs and Complex Networks

    Random Graphs and Complex Networks Volume 2 , author =. 2024 , publisher =. doi:10.1017/9781316795552 , file =

  64. [64]

    doi:10.48550/arXiv.2112.15379 , pubstate =

    The Sackin Index of Simplex Networks , author =. doi:10.48550/arXiv.2112.15379 , pubstate =