pith. sign in

arxiv: 1507.08685 · v1 · pith:NUAWJMA2new · submitted 2015-07-30 · 💻 cs.IT · cond-mat.stat-mech· math.IT· math.ST· stat.TH

Asymptotic Mutual Information for the Two-Groups Stochastic Block Model

classification 💻 cs.IT cond-mat.stat-mechmath.ITmath.STstat.TH
keywords informationmodelmutuallabelsper-vertexrandomvertexbetter
0
0 comments X
read the original abstract

We develop an information-theoretic view of the stochastic block model, a popular statistical model for the large-scale structure of complex networks. A graph $G$ from such a model is generated by first assigning vertex labels at random from a finite alphabet, and then connecting vertices with edge probabilities depending on the labels of the endpoints. In the case of the symmetric two-group model, we establish an explicit `single-letter' characterization of the per-vertex mutual information between the vertex labels and the graph. The explicit expression of the mutual information is intimately related to estimation-theoretic quantities, and --in particular-- reveals a phase transition at the critical point for community detection. Below the critical point the per-vertex mutual information is asymptotically the same as if edges were independent. Correspondingly, no algorithm can estimate the partition better than random guessing. Conversely, above the threshold, the per-vertex mutual information is strictly smaller than the independent-edges upper bound. In this regime there exists a procedure that estimates the vertex labels better than random guessing.

This paper has not been read by Pith yet.

discussion (0)

Sign in with ORCID, Apple, or X to comment. Anyone can read and Pith papers without signing in.

Forward citations

Cited by 1 Pith paper

Reviewed papers in the Pith corpus that reference this work. Sorted by Pith novelty score.

  1. Notes on Computational Hardness of Hypothesis Testing: Predictions using the Low-Degree Likelihood Ratio

    math.ST 2019-07 unverdicted novelty 6.0

    The low-degree likelihood ratio method predicts computational hardness of hypothesis testing problems, with new connections to spectral methods and a lower bound for tensor PCA.