pith. sign in

arxiv: 1807.07547 · v3 · pith:NA3FKEKUnew · submitted 2018-07-19 · 🧮 math.ST · cs.LG· stat.TH

Partial recovery bounds for clustering with the relaxed Kmeans

classification 🧮 math.ST cs.LGstat.TH
keywords meansrelaxedpartialprobabilitiesrecoveryresultssettingbounds
0
0 comments X
read the original abstract

We investigate the clustering performances of the relaxed $K$means in the setting of sub-Gaussian Mixture Model (sGMM) and Stochastic Block Model (SBM). After identifying the appropriate signal-to-noise ratio (SNR), we prove that the misclassification error decay exponentially fast with respect to this SNR. These partial recovery bounds for the relaxed $K$means improve upon results currently known in the sGMM setting. In the SBM setting, applying the relaxed $K$means SDP allows to handle general connection probabilities whereas other SDPs investigated in the literature are restricted to the assortative case (where within group probabilities are larger than between group probabilities). Again, this partial recovery bound complements the state-of-the-art results. All together, these results put forward the versatility of the relaxed $K$means.

This paper has not been read by Pith yet.

discussion (0)

Sign in with ORCID, Apple, or X to comment. Anyone can read and Pith papers without signing in.

Forward citations

Cited by 1 Pith paper

Reviewed papers in the Pith corpus that reference this work. Sorted by Pith novelty score.

  1. Davis-Kahan Theorem in the two-to-infinity norm and its application to perfect clustering

    stat.ME 2024-11 unverdicted novelty 5.0

    Develops a toolbox for two-to-infinity norm bounds on eigenvector deviations under multiple assumption sets and derives generic conditions for perfect clustering