pith. sign in

arxiv: 2606.18197 · v1 · pith:4NULKBNZnew · submitted 2026-06-16 · 📊 stat.AP · stat.ME

A Sensitivity Framework for Identifying Contagion under Latent Homophily for Fixed-in-Time Network Analyses, with an Application to U.S. House Congressional Voting

Pith reviewed 2026-06-26 21:38 UTC · model grok-4.3

classification 📊 stat.AP stat.ME
keywords contagionlatent homophilysensitivity analysisnetwork datacausal inferenceselection biascongressional votingrisk ratio
0
0 comments X

The pith

The gap between observed network contagion and true controlled direct effect is set by how strongly latent homophily changes which dyads connect.

A machine-rendered reading of the paper's core claim, the machinery that carries it, and where it could break.

The paper reframes the classic contagion-versus-homophily problem as a selection-bias question in the simple case of one fixed network observed at two time points with binary nodal outcomes. It defines a controlled direct effect that holds a tie fixed while intervening on an alter’s outcome, then shows that the difference between this effect and the raw connected-dyad risk ratio is fully determined by the strength of a latent homophily factor that shifts the composition of connected pairs. Nonparametric bounds on that strength are derived so that the original question “is there contagion?” becomes the concrete question “how large would latent homophily have to be to explain the entire observed association away?” A simulation study and an application to 2008 House votes on TARP illustrate when the bounds support or rule out contagion.

Core claim

We show that the gap between the CDE and the observed connected-dyad risk ratio is governed by how strongly a latent homophily variable shifts the composition of connected dyads. Inspired by Smith-style selection-bias sensitivity analysis and the risk-ratio bounding function of Ding and VanderWeele we develop interpretable nonparametric bounds. This translates the question 'is there contagion?' into the question 'how strong would latent homophily have to be to explain away the observed contagion?'

What carries the argument

A single sensitivity parameter that quantifies how strongly an unobserved homophily factor alters the composition of connected dyads, used to bound the difference between the controlled direct effect and the raw observed risk ratio.

If this is right

  • Under the derived bounds, contagion can be declared plausible once the homophily strength needed to explain the data away exceeds any value judged credible for that population.
  • The framework applies directly to any fixed network with two waves of binary nodal outcomes without requiring a parametric model of tie formation.
  • Simulation results show the bounds maintain nominal error control while retaining power to detect contagion when homophily is weak.
  • In the TARP vote data the method identifies the precise homophily threshold at which contagion remains credible.

Where Pith is reading between the lines

These are editorial extensions of the paper, not claims the author makes directly.

  • The same selection-bias reframing could be applied to directed or weighted networks once an appropriate dyad-composition parameter is defined.
  • Bounds derived here might be combined with measured covariates to tighten the required homophily threshold in empirical applications.
  • The approach suggests a design for future network studies: collect auxiliary data on potential homophily factors precisely to calibrate the sensitivity parameter.

Load-bearing premise

The only source of bias between the controlled direct effect and the observed association is latent homophily whose strength is captured by the sensitivity parameter.

What would settle it

Collect direct measures or proxies of the latent homophily variable in the same network setting and test whether its observed strength exceeds the value required by the bounds to nullify the contagion claim.

Figures

Figures reproduced from arXiv: 2606.18197 by Duncan A. Clark.

Figure 1
Figure 1. Figure 1: DAG showing latent U variables, with contagion and latent homophily for a single dyad; observed fixed covariates X are suppressed. We consider data generated from [PITH_FULL_IMAGE:figures/full_fig_p003_1.png] view at source ↗
Figure 4
Figure 4. Figure 4: Substantive sensitivity scenarios for CDE lower bounds among switchable egos ( [PITH_FULL_IMAGE:figures/full_fig_p023_4.png] view at source ↗
Figure 5
Figure 5. Figure 5: Naturally connected CDE simulation heatmaps for no latent effect on outcome ( [PITH_FULL_IMAGE:figures/full_fig_p029_5.png] view at source ↗
Figure 6
Figure 6. Figure 6: Forced-contact CDE simulation heatmaps for no latent effect on outcome ( [PITH_FULL_IMAGE:figures/full_fig_p029_6.png] view at source ↗
Figure 7
Figure 7. Figure 7: Naturally connected CDE simulation heatmaps for strong latent effect on outcome ( [PITH_FULL_IMAGE:figures/full_fig_p029_7.png] view at source ↗
Figure 8
Figure 8. Figure 8: Forced-contact CDE simulation heatmaps for strong latent effect on outcome ( [PITH_FULL_IMAGE:figures/full_fig_p030_8.png] view at source ↗
Figure 9
Figure 9. Figure 9: Naturally connected CDE truth, Type I error, and Type II error heatmaps for no latent [PITH_FULL_IMAGE:figures/full_fig_p031_9.png] view at source ↗
Figure 10
Figure 10. Figure 10: Forced-contact CDE truth, Type I error, and Type II error heatmaps for no latent effect [PITH_FULL_IMAGE:figures/full_fig_p032_10.png] view at source ↗
Figure 11
Figure 11. Figure 11: Naturally connected CDE truth, Type I error, and Type II error heatmaps for moderate [PITH_FULL_IMAGE:figures/full_fig_p032_11.png] view at source ↗
Figure 12
Figure 12. Figure 12: Forced-contact CDE truth, Type I error, and Type II error heatmaps for moderate [PITH_FULL_IMAGE:figures/full_fig_p033_12.png] view at source ↗
Figure 13
Figure 13. Figure 13: Naturally connected CDE truth, Type I error, and Type II error heatmaps for strong [PITH_FULL_IMAGE:figures/full_fig_p033_13.png] view at source ↗
Figure 14
Figure 14. Figure 14: Forced-contact CDE truth, Type I error, and Type II error heatmaps for strong latent [PITH_FULL_IMAGE:figures/full_fig_p034_14.png] view at source ↗
Figure 15
Figure 15. Figure 15: Degree distribution for the primary clean cosponsorship network. Ties are signed [PITH_FULL_IMAGE:figures/full_fig_p035_15.png] view at source ↗
Figure 16
Figure 16. Figure 16: Empirical connected risk ratio, observed- [PITH_FULL_IMAGE:figures/full_fig_p037_16.png] view at source ↗
read the original abstract

Whether connected units are similar because influence spreads across ties or because similar units form ties, is a long-standing problem. Contagion or influence is generically unidentified from observational network data. We consider the minimal and common setting of a single network, fixed over time, with two waves of a binary nodal outcome. Rather than positing a parametric model for network formation, we reframe identification of contagion as a selection-bias problem and develop a sensitivity framework. We define a controlled direct effect (CDE) holding a tie present while intervening on an alter's outcome. We show that the gap between the CDE and the observed connected-dyad risk ratio is governed by how strongly a latent homophily variable shifts the composition of connected dyads. Inspired by Smith-style selection-bias sensitivity analysis and the risk-ratio bounding function of Ding and VanderWeele we develop interpretable nonparametric bounds. This translates the question "is there contagion?" into the question "how strong would latent homophily have to be to explain away the observed contagion?" A simulation study characterizes the bounds' error control and power. We apply the framework to the 2008 U.S. House votes on the Troubled Asset Relief Program, identifying under which assumptions contagion is plausible.

Editorial analysis

A structured set of objections, weighed in public.

Desk editor's note, referee report, simulated authors' rebuttal, and a circularity audit. Tearing a paper down is the easy half of reading it; the pith above is the substance, this is the friction.

Referee Report

0 major / 2 minor

Summary. The paper develops a sensitivity framework for contagion identification in a minimal setting of one fixed network observed at two time points with binary nodal outcomes. It reframes the problem as selection bias due to latent homophily, defines a controlled direct effect (CDE) holding ties fixed while intervening on alter outcomes, and derives nonparametric bounds (inspired by Smith-style selection-bias analysis and Ding-VanderWeele risk-ratio bounds) on the gap between the CDE and the observed connected-dyad risk ratio. The central translation is that assessing contagion reduces to assessing how strong latent homophily must be to explain away the observed association. The framework is evaluated in a simulation study on error control and power and applied to 2008 U.S. House TARP votes.

Significance. If the bounds derivation holds, the work supplies a practical, nonparametric sensitivity tool for distinguishing contagion from latent homophily without parametric network-formation models. Strengths include the explicit single sensitivity parameter, the simulation characterizing bound behavior, and the empirical demonstration; these make the contribution concrete and falsifiable in applied settings. The approach extends established selection-bias techniques to network contagion questions in a way that could see routine use in social-science network analyses.

minor comments (2)
  1. [Simulation study] The simulation section would benefit from an explicit table or figure reporting numerical power and coverage rates across the range of homophily strengths examined, to allow direct verification of the claimed error-control properties.
  2. [Methods] Notation for the bounding function and the latent-homophily sensitivity parameter should be introduced with a single consolidated definition early in the methods section rather than piecemeal.

Simulated Author's Rebuttal

0 responses · 0 unresolved

We thank the referee for their accurate summary of the manuscript and for recognizing its potential contribution. The recommendation for minor revision is noted. No specific major comments appear in the report, so there are no individual points requiring rebuttal or revision at this stage. We will incorporate any editorial or minor changes in the revised version.

Circularity Check

0 steps flagged

No significant circularity; derivation is self-contained sensitivity analysis

full rationale

The paper reframes contagion identification as a selection-bias problem under a fixed network with two binary outcome waves, defines the CDE, and derives nonparametric bounds on the gap to the observed risk ratio using the strength of latent homophily as the sensitivity parameter. This directly follows established external techniques (Smith-style selection-bias analysis and Ding-VanderWeele risk-ratio bounds) without any reduction of the target quantity to a fitted parameter defined by the same equations, without self-citation load-bearing on the central claim, and without ansatz smuggling or renaming. The result is explicitly a sensitivity framework (not point identification), with the single parameter acknowledged as the modeling choice; the logic remains internally consistent and independent of the paper's own inputs.

Axiom & Free-Parameter Ledger

1 free parameters · 2 axioms · 0 invented entities

The central claim rests on the definition of the controlled direct effect, the fixed-network two-wave setting, and the single sensitivity parameter that governs the gap between the CDE and the observed risk ratio.

free parameters (1)
  • latent homophily sensitivity parameter
    The nonparametric bounds are expressed as functions of this parameter that quantifies the shift in connected-dyad composition due to unobserved homophily.
axioms (2)
  • domain assumption Single fixed network with two waves of binary nodal outcome
    Described as the minimal and common setting under consideration.
  • domain assumption No parametric model for network formation is posited
    The problem is instead reframed entirely as a selection-bias problem.

pith-pipeline@v0.9.1-grok · 5763 in / 1500 out tokens · 29921 ms · 2026-06-26T21:38:47.899968+00:00 · methodology

discussion (0)

Sign in with ORCID, Apple, or X to comment. Anyone can read and Pith papers without signing in.

Reference graph

Works this paper leans on

42 extracted references · 16 canonical work pages

  1. [1]

    Thomas , title =

    Cosma Rohilla Shalizi and Andrew C. Thomas , title =. Sociological Methods & Research , volume =. 2011 , doi =

  2. [2]

    Political Analysis , author=

    Contagion, Confounding, and Causality: Confronting the Three C’s of Observational Political Networks Research , volume=. Political Analysis , author=. 2023 , pages=. doi:10.1017/pan.2022.35 , number=

  3. [3]

    Journal of the Royal Statistical Society Series A: Statistics in Society , volume =

    Clark, Duncan A and Handcock, Mark S , title =. Journal of the Royal Statistical Society Series A: Statistics in Society , volume =. 2024 , month =. doi:10.1093/jrsssa/qnae001 , url =

  4. [4]

    Graphical and Recursive Models for Contingency Tables , volume =

    Wermuth, Nanny and Lauritzen, Steffen , year =. Graphical and Recursive Models for Contingency Tables , volume =. Biometrika , doi =

  5. [5]

    2001 , month =

    Spirtes, Peter and Glymour, Clark and Scheines, Richard , title =. 2001 , month =. doi:10.7551/mitpress/1754.001.0001 , url =

  6. [6]

    Proceedings of the Sixteenth International Conference on Artificial Intelligence and Statistics , pages =

    Statistical Tests for Contagion in Observational Social Network Studies , author =. Proceedings of the Sixteenth International Conference on Artificial Intelligence and Statistics , pages =. 2013 , editor =

  7. [7]

    Sharp sensitivity bounds for mediation under unmeasured mediator-outcome confounding , volume =

    Ding, Peng and VanderWeele, Tyler , year =. Sharp sensitivity bounds for mediation under unmeasured mediator-outcome confounding , volume =. Biometrika , doi =

  8. [8]

    Smith and Tyler J

    Louisa H. Smith and Tyler J. VanderWeele , title =. Epidemiology , volume =. 2019 , doi =

  9. [9]

    2009 , isbn =

    Pearl, Judea , title =. 2009 , isbn =

  10. [10]

    Probabilistic reasoning in intelligent systems: networks of plausible inference , url =

    Pearl, Judea , biburl =. Probabilistic reasoning in intelligent systems: networks of plausible inference , url =

  11. [11]

    and Arah, Onyebuchi A

    VanderWeele, Tyler J. and Arah, Onyebuchi A. , title =. Epidemiology , volume =. 2011 , doi =

  12. [12]

    and Robins, James M

    Hernan, Miguel A. and Robins, James M. , title =. 2020 , publisher =

  13. [13]

    1995 , url=

    Structure and influence : statistical models for the dynamics of actor attributes, network structure, and their interdependence , author=. 1995 , url=

  14. [14]

    and Fowler, James H

    Christakis, Nicholas A. and Fowler, James H. , title =. New England Journal of Medicine , volume =. 2007 , month =. doi:10.1056/NEJMsa066082 , pmid =

  15. [15]

    Anish Agarwal and Rahul Singh

    McFowland, III, Edward and Shalizi, Cosma Rohilla , title =. Journal of the American Statistical Association , volume =. 2023 , publisher =. doi:10.1080/01621459.2021.1953506 , URL =

  16. [16]

    Roll Call Vote XML Files , year =

  17. [17]

    United States Congress Legislative Data , year =

  18. [18]

    and Haenszel, W

    Cornfield, J. and Haenszel, W. and Hammond, E. C. and Lilienfeld, A. M. and Shimkin, M. B. and Wynder, E. L. , title =. Journal of the National Cancer Institute , year =

  19. [19]

    , title =

    Bross, Irwin D. , title =. Journal of Chronic Diseases , year =. doi:10.1016/0021-9681(66)90062-2 , pmid =

  20. [20]

    Journal of the Royal Statistical Society Series B: Statistical Methodology , volume =

    Cinelli, Carlos and Hazlett, Chad , title =. Journal of the Royal Statistical Society Series B: Statistical Methodology , volume =. 2020 , month =. doi:10.1111/rssb.12348 , url =

  21. [21]

    and Ding, Peng , title =

    VanderWeele, Tyler J. and Ding, Peng , title =. Annals of Internal Medicine , year =. doi:10.7326/M16-2607 , pmid =

  22. [22]

    Proceedings of the National Academy of Sciences , volume =

    Sinan Aral and Lev Muchnik and Arun Sundararajan , title =. Proceedings of the National Academy of Sciences , volume =. 2009 , doi =

  23. [23]

    , title =

    Butts, Carter T. , title =. Sociological Methodology , volume =. 2008 , doi =

  24. [24]

    Disentangling social contagion from prior similarity in time-ordered behaviour sequences , journal =

    Leifeld, Philip and Mart. Disentangling social contagion from prior similarity in time-ordered behaviour sequences , journal =. 2026 , note =. doi:10.1093/jrsssa/qnag069 , url =

  25. [25]

    , title =

    Artico, Igor and Wit, Ernst C. , title =. Journal of the Royal Statistical Society Series A: Statistics in Society , volume =. 2023 , doi =

  26. [26]

    Steglich, Christian and Snijders, Tom A. B. and Pearson, Michael , title =. Sociological Methodology , volume =. doi:https://doi.org/10.1111/j.1467-9531.2010.01225.x , url =. https://onlinelibrary.wiley.com/doi/pdf/10.1111/j.1467-9531.2010.01225.x , abstract =

  27. [27]

    and Handcock, Mark S

    Krivitsky, Pavel N. and Handcock, Mark S. , title =. Journal of the Royal Statistical Society: Series B (Statistical Methodology) , volume =. 2014 , doi =

  28. [28]

    British Journal of Management , volume =

    Cai, Wanxiang and Polzin, Friedemann , title =. British Journal of Management , volume =. doi:https://doi.org/10.1111/1467-8551.12917 , url =. https://onlinelibrary.wiley.com/doi/pdf/10.1111/1467-8551.12917 , abstract =

  29. [29]

    Epidemic processes in complex networks , author =. Rev. Mod. Phys. , volume =. 2015 , month =. doi:10.1103/RevModPhys.87.925 , url =

  30. [30]

    Kudos make you run! How runners influence each other on the online social network Strava , journal =

    Rob Franken and Hidde Bekhuis and Jochem Tolsma , keywords =. Kudos make you run! How runners influence each other on the online social network Strava , journal =. 2023 , issn =. doi:https://doi.org/10.1016/j.socnet.2022.10.001 , url =

  31. [31]

    and Foster, Mark D

    Couch, Jim F. and Foster, Mark D. and Malone, Keith and Black, David L. , title =. Cato Journal , year =

  32. [32]

    The Washington Post , year =

    Montgomery, Lori and Cho, David , title =. The Washington Post , year =

  33. [33]

    American Economic Review , Volume =

    Mian, Atif and Sufi, Amir and Trebbi, Francesco , Title =. American Economic Review , Volume =. 2010 , Month =. doi:10.1257/aer.100.5.1967 , URL =

  34. [34]

    , title =

    Ramirez, Carlos D. , title =. Review of Law & Economics , year =

  35. [35]

    Review of Finance , year =

    Tahoun, Ahmed and van Lent, Laurence , title =. Review of Finance , year =. doi:10.1093/rof/rfy015 , url =

  36. [36]

    http://www.nber.org/papers/w16437

    Cohen, Lauren and Malloy, Christopher. Friends in High Places. 2010. doi:10.3386/w16437 , URL = "http://www.nber.org/papers/w16437", abstract =

  37. [37]

    , title =

    Fowler, James H. , title =. Political Analysis , volume =. 2006 , doi =

  38. [38]

    , title =

    Kirkland, Justin H. , title =. The Journal of Politics , volume =. 2011 , doi =

  39. [39]

    and Halloran, M

    Hudgens, Michael G. and Halloran, M. Elizabeth , title =. Journal of the American Statistical Association , volume =. 2008 , doi =

  40. [40]

    and VanderWeele, Tyler J

    Ogburn, Elizabeth L. and VanderWeele, Tyler J. , title =. Statistical Science , volume =. 2014 , doi =

  41. [41]

    and Samii, Cyrus , title =

    Aronow, Peter M. and Samii, Cyrus , title =. The Annals of Applied Statistics , volume =. 2017 , doi =

  42. [42]

    Average Treatment Effects in the Presence of Unknown Interference , journal =

    S. Average Treatment Effects in the Presence of Unknown Interference , journal =. 2021 , doi =