Contradiction Graphs Determine VC Dimension
Pith reviewed 2026-05-21 06:38 UTC · model grok-4.3
The pith
The contradiction graph G_m(H) determines whether a binary concept class has VC dimension at least m.
A machine-rendered reading of the paper's core claim, the machinery that carries it, and where it could break.
Core claim
For a class H subsets of {0,1}^X, the order-m contradiction graph G_m(H) has vertices given by the H-realizable labeled sequences of length m, with an edge between two vertices whenever the sequences assign opposite labels to some common element of X. The main result establishes that this single graph determines the predicate VCdim(H) greater than or equal to m. It follows that the full sequence of graphs (G_m(H)) for m greater than or equal to 1 determines the precise VC dimension and, in particular, whether the dimension is finite or infinite.
What carries the argument
The order-m contradiction graph G_m(H), with vertices as H-realizable labeled sequences of length m and edges when sequences disagree on the label of a shared domain point.
If this is right
- The exact numerical value of VCdim(H) is recoverable from the sequence of contradiction graphs.
- Finite versus infinite VC dimension can be decided by inspecting the graphs for increasing m.
- Any learning-theoretic bound that depends on the VC dimension can be re-expressed as a property of the corresponding contradiction graphs.
- The presence or absence of an edge in G_m(H) encodes the existence of a shattering witness or its absence.
Where Pith is reading between the lines
- The result suggests that VC dimension computation might reduce to checking connectivity or other graph invariants in these contradiction graphs.
- Similar graph constructions could be explored for related combinatorial dimensions such as the fat-shattering dimension.
- If the graphs are efficiently constructible from samples, they might yield practical tests for bounded VC dimension in concrete hypothesis classes.
Load-bearing premise
The adjacency relation defined solely by opposite labels on a shared domain point captures all distinctions needed to decide the VC-dimension threshold, without further restrictions on the domain X or additional structure on H.
What would settle it
A binary class H where G_m(H) is identical for two different values of the VC dimension threshold at some m, or where the graph fails to reflect whether m points are shattered.
Figures
read the original abstract
We study the contradiction graphs associated with binary concept classes. For a class $H \subseteq \{0,1\}^X$, the order-$m$ contradiction graph $G_m(H)$ has as vertices the $H$-realizable labeled sequences of length $m$, with two vertices adjacent when the two sequences assign opposite labels to some common domain point. Our main result is that the single graph $G_m(H)$ determines the threshold predicate $\mathrm{VCdim}(H)\ge m$. Consequently, the full sequence $(G_m(H))_{m \ge 1}$ determines the exact VC dimension and, in particular, detects finite versus infinite VC dimension, answering a question posed by Alon et al. (2024).
Editorial analysis
A structured set of objections, weighed in public.
Referee Report
Summary. The manuscript defines the order-m contradiction graph G_m(H) for a binary concept class H ⊆ {0,1}^X, with vertices given by the H-realizable labeled sequences of length m and edges present precisely when two sequences assign opposite labels to at least one shared domain point. The central result asserts that a single such graph determines the predicate VCdim(H) ≥ m; consequently the sequence (G_m(H))_{m≥1} determines the exact VC-dimension and distinguishes finite from infinite cases, answering a question of Alon et al. (2024).
Significance. If the claimed equivalence holds, the work supplies a purely graph-theoretic criterion for the VC-dimension threshold. This characterization could open new structural and algorithmic approaches to learning-theoretic questions, particularly the detection of infinite VC-dimension, and would constitute a non-trivial contribution to the theory of concept classes.
major comments (1)
- [Definition of G_m(H)] Definition of G_m(H): the vertices are defined as H-realizable labeled sequences of length m with adjacency only when opposite labels occur on a shared domain point, yet no explicit requirement is stated that the m domain points must be distinct. Standard VC-dimension is defined via shattering on m distinct points; if repeated points are permitted, realizable vertices and the induced subgraphs (e.g., cliques corresponding to full shattering) can arise from configurations with no counterpart in the distinct-point definition. The main proof must therefore either restrict the vertex set to distinct points or prove that the threshold predicate is invariant under this relaxation; otherwise the claimed determination of VCdim(H) ≥ m fails to match the classical notion.
Simulated Author's Rebuttal
We thank the referee for the careful reading and constructive feedback. We address the single major comment below and will revise the manuscript to resolve the definitional ambiguity while preserving the main results.
read point-by-point responses
-
Referee: [Definition of G_m(H)] Definition of G_m(H): the vertices are defined as H-realizable labeled sequences of length m with adjacency only when opposite labels occur on a shared domain point, yet no explicit requirement is stated that the m domain points must be distinct. Standard VC-dimension is defined via shattering on m distinct points; if repeated points are permitted, realizable vertices and the induced subgraphs (e.g., cliques corresponding to full shattering) can arise from configurations with no counterpart in the distinct-point definition. The main proof must therefore either restrict the vertex set to distinct points or prove that the threshold predicate is invariant under this relaxation; otherwise the claimed determination of VCdim(H) ≥ m fails to match the classical notion.
Authors: We thank the referee for identifying this important point of precision. Our original intention was to work with sequences over arbitrary (possibly repeated) domain points, but we agree that this risks a mismatch with the standard definition of VC-dimension, which requires distinct points. In the revised manuscript we will explicitly restrict the vertex set of G_m(H) to H-realizable labeled sequences on m distinct domain points. We have verified that the main theorem continues to hold under this restriction: the forward direction (VCdim(H) ≥ m implies the appropriate subgraph in G_m(H)) relies on the existence of a shattered set of distinct points, while the converse direction uses the contradiction-graph structure on those same distinct points. The change is therefore a clarification rather than a substantive alteration of the results. revision: yes
Circularity Check
No significant circularity; graph-to-VCdim relation is a non-trivial structural claim
full rationale
The paper defines G_m(H) explicitly via H-realizable sequences of length m with adjacency on label contradictions at shared points, then asserts that this single graph encodes the predicate VCdim(H) >= m. This is presented as a derived result rather than a redefinition or tautology. No equations reduce the claimed determination to the input definitions by construction, no parameters are fitted and relabeled as predictions, and the cited prior question (Alon et al. 2024) is external. The derivation chain therefore remains self-contained against the given definitions and does not rely on load-bearing self-citations or ansatzes smuggled from prior author work. The skeptic concern about repeated versus distinct domain points addresses potential gaps in the proof's scope, not circularity.
Axiom & Free-Parameter Ledger
axioms (2)
- standard math Standard definition of VC dimension for binary concept classes over an arbitrary domain X.
- domain assumption H-realizable labeled sequences of length m are well-defined for any finite m.
Lean theorems connected to this paper
-
IndisputableMonolith/Foundation/AbsoluteFloorClosure.leanabsolute_floor_iff_bare_distinguishability unclear?
unclearRelation between the paper passage and the cited Recognition theorem.
vertices the H-realizable labeled sequences of length m, with two vertices adjacent when the two sequences assign opposite labels to some common domain point
-
IndisputableMonolith/Foundation/AlexanderDuality.leanalexander_duality_circle_linking unclear?
unclearRelation between the paper passage and the cited Recognition theorem.
If Gm(H) contains a cube-trace clique of size 2^m then VCdim(H) ≥ m
What do these tags mean?
- matches
- The paper's claim is directly supported by a theorem in the formal canon.
- supports
- The theorem supports part of the paper's argument, but the paper may add assumptions or extra steps.
- extends
- The paper goes beyond the formal theorem; the theorem is a base layer rather than the whole result.
- uses
- The paper appears to rely on the theorem as machinery.
- contradicts
- The paper's claim conflicts with a theorem or certificate in the canon.
- unclear
- Pith found a possible connection, but the passage is too broad, indirect, or ambiguous to say the theorem truly supports the claim.
Reference graph
Works this paper leans on
-
[1]
Proceedings of the Thirty Seventh Conference on Learning Theory , series =
Alon, Noga and Moran, Shay and Schefler, Hilla and Yehudayoff, Amir , title =. Proceedings of the Thirty Seventh Conference on Learning Theory , series =. 2024 , editor =
work page 2024
-
[2]
Vapnik, V. N. and Chervonenkis, A. Ya. , title =. Theory of Probability and Its Applications , volume =. 1971 , doi =
work page 1971
-
[3]
Journal of Combinatorial Theory, Series A , volume =
Sauer, Norbert , title =. Journal of Combinatorial Theory, Series A , volume =. 1972 , doi =
work page 1972
-
[4]
Pacific Journal of Mathematics , volume =
Shelah, Saharon , title =. Pacific Journal of Mathematics , volume =. 1972 , doi =
work page 1972
- [5]
- [6]
- [7]
-
[8]
Littlestone, Nick , title =. Machine Learning , volume =. 1988 , doi =
work page 1988
-
[9]
Shattering, Graph Orientations, and Connectivity , journal =
Kozma, L. Shattering, Graph Orientations, and Connectivity , journal =. 2013 , doi =
work page 2013
-
[10]
Journal of Privacy and Confidentiality , volume =
Dwork, Cynthia and McSherry, Frank and Nissim, Kobbi and Smith, Adam , title =. Journal of Privacy and Confidentiality , volume =. 2017 , doi =
work page 2017
-
[11]
Advances in Cryptology -- EUROCRYPT 2006 , series =
Dwork, Cynthia and Kenthapadi, Krishnaram and McSherry, Frank and Mironov, Ilya and Naor, Moni , title =. Advances in Cryptology -- EUROCRYPT 2006 , series =. 2006 , editor =
work page 2006
-
[12]
Foundations and Trends in Theoretical Computer Science , volume =
Dwork, Cynthia and Roth, Aaron , title =. Foundations and Trends in Theoretical Computer Science , volume =. 2014 , doi =
work page 2014
-
[13]
and Nissim, Kobbi and Raskhodnikova, Sofya and Smith, Adam , title =
Kasiviswanathan, Shiva Prasad and Lee, Homin K. and Nissim, Kobbi and Raskhodnikova, Sofya and Smith, Adam , title =. SIAM Journal on Computing , volume =. 2011 , doi =
work page 2011
-
[14]
Proceedings of the 24th Annual Conference on Learning Theory , series =
Chaudhuri, Kamalika and Hsu, Daniel , title =. Proceedings of the 24th Annual Conference on Learning Theory , series =. 2011 , editor =
work page 2011
-
[15]
Blum, Avrim and Ligett, Katrina and Roth, Aaron , title =. Journal of the ACM , volume =. 2013 , doi =
work page 2013
-
[16]
Beimel, Amos and Brenner, Hai and Kasiviswanathan, Shiva Prasad and Nissim, Kobbi , title =. Machine Learning , volume =. 2014 , doi =
work page 2014
-
[17]
SIAM Journal on Computing , volume =
Feldman, Vitaly and Xiao, David , title =. SIAM Journal on Computing , volume =. 2015 , doi =
work page 2015
-
[18]
Theory of Computing , volume =
Beimel, Amos and Nissim, Kobbi and Stemmer, Uri , title =. Theory of Computing , volume =. 2016 , doi =
work page 2016
-
[19]
Journal of Machine Learning Research , volume =
Beimel, Amos and Nissim, Kobbi and Stemmer, Uri , title =. Journal of Machine Learning Research , volume =. 2019 , url =
work page 2019
-
[20]
Alon, Noga and Bun, Mark and Livni, Roi and Malliaris, Maryanthe and Moran, Shay , title =. Journal of the ACM , volume =. 2022 , doi =
work page 2022
-
[21]
Proceedings of the 53rd Annual ACM SIGACT Symposium on Theory of Computing , series =
Ghazi, Badih and Golowich, Noah and Kumar, Ravi and Manurangsi, Pasin , title =. Proceedings of the 53rd Annual ACM SIGACT Symposium on Theory of Computing , series =. 2021 , publisher =
work page 2021
-
[22]
Proceedings of the 35th International Conference on Algorithmic Learning Theory , series =
Bun, Mark and Cohen, Aloni and Desai, Rathin , title =. Proceedings of the 35th International Conference on Algorithmic Learning Theory , series =. 2024 , editor =
work page 2024
-
[23]
Theory of Cryptography , series =
Cynthia Dwork and Frank McSherry and Kobbi Nissim and Adam Smith , title =. Theory of Cryptography , series =. 2006 , doi =
work page 2006
discussion (0)
Sign in with ORCID, Apple, or X to comment. Anyone can read and Pith papers without signing in.