pith. sign in

arxiv: 2606.01092 · v1 · pith:JKQMA2A4new · submitted 2026-05-31 · 💻 cs.LG · cs.AI

A Fiber Criterion for Representation Identifiability in Supervised Learning

Pith reviewed 2026-06-28 17:40 UTC · model grok-4.3

classification 💻 cs.LG cs.AI
keywords representation identifiabilitysupervised learningfiber criterionrepresentation-head factorizationpredictor compositionaugmentation obstructionidentifiability criteria
0
0 comments X

The pith

A representation property is identifiable from the induced predictor exactly when it is constant on the fibers of the projection from representation-head pairs to the composite map.

A machine-rendered reading of the paper's core claim, the machinery that carries it, and where it could break.

The paper establishes that supervised learning constrains only the composite predictor f = c ∘ h but leaves open which factorization produced it. A property of the representation h counts as identifiable from f precisely when the property takes the same value on every pair (h, c) that composes to the same f, i.e., when the property is constant on the fibers of the projection map and therefore descends to a property of f itself. Predictor-preserving augmentation supplies the canonical counter-example: extra information can be attached to h while c simply ignores it, so the predictor stays fixed while properties such as minimality, invariance, or semantic content change. The criterion therefore separates representation-level claims from optimization or finite-sample estimation and shows that such claims always require modeling assumptions beyond the observed input-output behavior.

Core claim

For a fixed class of admissible representation-head pairs, a representation property is identifiable from the induced predictor exactly when it is constant on the fibers of the projection (h,c)↦c∘h, equivalently when it descends to a well-defined property of the predictor. Predictor-preserving augmentation gives a canonical obstruction by appending auxiliary information to a representation while the head ignores it, leaving the predictor unchanged but altering properties such as minimality, compression, invariance, equivariance, nuisance information, or semantic accessibility.

What carries the argument

The fibers of the projection map (h,c)↦c∘h, which collect all admissible factorizations that induce identical predictors.

If this is right

  • Predictor-preserving augmentation can change minimality, invariance, or semantic accessibility while the observed predictor stays identical.
  • Representation-level claims in supervised settings always require modeling assumptions, objectives, or inductive biases beyond input-output behavior.
  • Finite-sample diagnostics can exhibit different representations selected by different constraints even when supervised performance matches.
  • Identifiability statements are relative to the chosen class of admissible pairs rather than intrinsic to the data.

Where Pith is reading between the lines

These are editorial extensions of the paper, not claims the author makes directly.

  • Any claim that a learned representation is minimal or invariant must be accompanied by an explicit statement of the admissible class of heads.
  • The fiber view suggests examining whether common regularizers in deep networks implicitly restrict the class enough to restore identifiability.
  • The same obstruction applies to claims about disentanglement or compression that rest solely on predictive performance.

Load-bearing premise

The argument requires a fixed, explicitly stated class of admissible representation-head pairs over which the fibers are taken.

What would settle it

A concrete property of representations that varies across different (h,c) pairs producing the same predictor yet remains recoverable from supervised data alone on that class would falsify the claimed equivalence.

Figures

Figures reproduced from arXiv: 2606.01092 by Vasileios Sevetlidis.

Figure 1
Figure 1. Figure 1: Information loss from factorization to prediction to scalar risk. The full representation–head pair [PITH_FULL_IMAGE:figures/full_fig_p005_1.png] view at source ↗
Figure 2
Figure 2. Figure 2: Predictor equivalence as fibers of the projection map. The projection [PITH_FULL_IMAGE:figures/full_fig_p007_2.png] view at source ↗
Figure 3
Figure 3. Figure 3: Predictor-preserving augmentation. The original predictor is [PITH_FULL_IMAGE:figures/full_fig_p010_3.png] view at source ↗
Figure 4
Figure 4. Figure 4: Waterbirds representation diagnostics after supervised-performance matching. Models with comparable test [PITH_FULL_IMAGE:figures/full_fig_p016_4.png] view at source ↗
Figure 5
Figure 5. Figure 5: Supervised risk for a factorized predictor. The supervised objective evaluates the composite prediction [PITH_FULL_IMAGE:figures/full_fig_p021_5.png] view at source ↗
Figure 6
Figure 6. Figure 6: The fiber/descent criterion. A representation property is identifiable from the induced predictor exactly when [PITH_FULL_IMAGE:figures/full_fig_p022_6.png] view at source ↗
Figure 7
Figure 7. Figure 7: Increase in auxiliary-attribute probe AUC after predictor-preserving augmentation on CelebA. The supervised [PITH_FULL_IMAGE:figures/full_fig_p026_7.png] view at source ↗
Figure 8
Figure 8. Figure 8: Probe AUC from the original representation [PITH_FULL_IMAGE:figures/full_fig_p027_8.png] view at source ↗
Figure 9
Figure 9. Figure 9: CelebA controls. Appending an independent random coordinate or a shuffled attribute coordinate produces [PITH_FULL_IMAGE:figures/full_fig_p028_9.png] view at source ↗
Figure 10
Figure 10. Figure 10: * CIFAR-10 [PITH_FULL_IMAGE:figures/full_fig_p030_10.png] view at source ↗
Figure 13
Figure 13. Figure 13: * CIFAR-10 [PITH_FULL_IMAGE:figures/full_fig_p031_13.png] view at source ↗
Figure 16
Figure 16. Figure 16: Constraint paths on Waterbirds. Sweeping constraint strengths changes task performance and representation [PITH_FULL_IMAGE:figures/full_fig_p034_16.png] view at source ↗
Figure 17
Figure 17. Figure 17: Holonomy versus loop radius. The pathwise diagnostic is supporting evidence. It is strongest on Colored [PITH_FULL_IMAGE:figures/full_fig_p035_17.png] view at source ↗
read the original abstract

Supervised learning evaluates predictors through their input-output behavior. When a predictor is implemented as a composition $f=c\circ h$, supervised evidence constrains the composite map $f$ but need not determine the representation-head factorization $(h,c)$. This paper formalizes the resulting representation-level identifiability problem: for a class of admissible representation-head pairs, a representation property is identifiable from the induced predictor exactly when it is constant on the fibers of the projection $(h,c)\mapsto c\circ h$, equivalently when it descends to a well-defined property of the predictor. Predictor-preserving augmentation gives a canonical obstruction: auxiliary information can be appended to a representation while the head ignores it, leaving the predictor unchanged but altering properties such as minimality, compression, invariance, equivariance, nuisance information, or semantic accessibility. This construction separates representation identifiability from optimization and finite-sample estimation. Finite-sample diagnostics illustrate, rather than prove, the criterion: exact algebraic witnesses hold the predictor fixed while changing representation diagnostics, and matched-performance Waterbirds models show that different constraints can select different representations at similar supervised performance. The results clarify that representation-level claims require assumptions, objectives, measurements, or inductive biases beyond supervised predictive behavior alone.

Editorial analysis

A structured set of objections, weighed in public.

Desk editor's note, referee report, simulated authors' rebuttal, and a circularity audit. Tearing a paper down is the easy half of reading it; the pith above is the substance, this is the friction.

Referee Report

2 major / 2 minor

Summary. The paper claims that, for any fixed class of admissible representation-head pairs (h, c), a representation property is identifiable from the induced predictor f = c ∘ h exactly when the property is constant on the fibers of the projection (h, c) ↦ f (equivalently, when it descends to a well-defined property of the predictor). It introduces predictor-preserving augmentation as a canonical obstruction that alters representation properties while leaving f unchanged, and uses finite-sample diagnostics (algebraic witnesses and matched-performance Waterbirds models) to illustrate that supervised evidence alone does not determine representation-level properties without further assumptions.

Significance. If the formalization holds, the fiber criterion supplies a precise, relative-to-the-class language for the well-known gap between predictor behavior and representation properties, cleanly separating identifiability questions from optimization and finite-sample estimation. The explicit framing that supervised evidence is insufficient without additional biases or measurements is consistent with the definitional result and may help organize future work on representation learning.

major comments (2)
  1. [fiber criterion definition (abstract and § on identifiability problem)] The central equivalence (property identifiable iff constant on fibers) is presented as a formal criterion but follows immediately from the definition of identifiability relative to the chosen class of pairs; the manuscript does not derive additional consequences or bounds that would make the statement non-tautological. This does not invalidate the framing but limits the load-bearing technical contribution to the augmentation construction and examples.
  2. [framework setup and conclusion] The admissible class of (h, c) pairs is treated as fixed and given, yet the paper provides no general procedure or verification method for constructing or validating this class from data or domain knowledge; without such a procedure the identifiability statement remains relative to an arbitrary modeling choice, as noted in the weakest-assumption discussion.
minor comments (2)
  1. [finite-sample examples] The finite-sample diagnostics (Waterbirds and algebraic witnesses) are described as illustrative rather than exhaustive; adding a brief statement on the scope of the enumeration performed would clarify their role.
  2. [main formal statement] Notation for the projection map and fibers is introduced clearly in the abstract but could be repeated with an explicit equation number in the main text for easier reference.

Simulated Author's Rebuttal

2 responses · 0 unresolved

We thank the referee for the careful reading and for recommending minor revision. The comments correctly identify that the central equivalence is definitional and that the admissible class is a modeling choice; we address each point below and indicate where revisions will be made.

read point-by-point responses
  1. Referee: [fiber criterion definition (abstract and § on identifiability problem)] The central equivalence (property identifiable iff constant on fibers) is presented as a formal criterion but follows immediately from the definition of identifiability relative to the chosen class of pairs; the manuscript does not derive additional consequences or bounds that would make the statement non-tautological. This does not invalidate the framing but limits the load-bearing technical contribution to the augmentation construction and examples.

    Authors: We agree that the equivalence follows directly from the definition of identifiability relative to a fixed class. The manuscript frames the statement as a criterion to supply a precise, relative-to-the-class test that can be applied to concrete properties (minimality, invariance, etc.). The load-bearing technical element is indeed the predictor-preserving augmentation construction, which supplies a canonical obstruction and separates representation identifiability from optimization and estimation questions. We will add a short clarifying sentence in the introduction to make the definitional character explicit while preserving the criterion language for its diagnostic utility. revision: partial

  2. Referee: [framework setup and conclusion] The admissible class of (h, c) pairs is treated as fixed and given, yet the paper provides no general procedure or verification method for constructing or validating this class from data or domain knowledge; without such a procedure the identifiability statement remains relative to an arbitrary modeling choice, as noted in the weakest-assumption discussion.

    Authors: The admissible class is deliberately treated as a fixed modeling primitive that encodes domain assumptions about admissible factorizations. The weakest-assumption discussion already states that identifiability claims are always relative to this choice and that no data-driven procedure can validate the class without introducing further assumptions. Because the class is part of the modeling setup rather than an object to be inferred, the paper does not supply a general construction method; this relativity is intentional and is not presented as a shortcoming. revision: no

Circularity Check

0 steps flagged

No significant circularity identified

full rationale

The paper formalizes representation identifiability relative to a fixed class of admissible (h,c) pairs as the property being constant on fibers of the projection (h,c) ↦ c∘h. This equivalence is definitional by construction of the identifiability notion itself and does not reduce any derived claim to fitted parameters, self-citations, or smuggled ansatzes. The predictor-preserving augmentation and finite-sample examples (Waterbirds, algebraic witnesses) illustrate the criterion without circular reduction. The framework is self-contained, explicitly conditioning results on the modeling choice of class rather than claiming intrinsic uniqueness or external derivation.

Axiom & Free-Parameter Ledger

0 free parameters · 2 axioms · 0 invented entities

The central claim rests on the existence of a well-defined class of admissible (h,c) pairs and on the mathematical fact that properties constant on fibers descend to the quotient; these are standard set-theoretic notions rather than new postulates.

axioms (2)
  • domain assumption There exists a class of admissible representation-head pairs over which the projection map is defined.
    The identifiability statement is relative to this class; the abstract invokes it when defining the fibers.
  • standard math Properties of representations that are constant on fibers descend to well-defined properties of the predictor.
    This is the standard quotient construction used to define the criterion.

pith-pipeline@v0.9.1-grok · 5738 in / 1411 out tokens · 21903 ms · 2026-06-28T17:40:19.823588+00:00 · methodology

discussion (0)

Sign in with ORCID, Apple, or X to comment. Anyone can read and Pith papers without signing in.

Reference graph

Works this paper leans on

53 extracted references · 5 canonical work pages · 1 internal anchor

  1. [1]

    On the mathematical foundations of theoretical statistics,

    On the Mathematical Foundations of Theoretical Statistics , author =. Philosophical Transactions of the Royal Society of London. Series A, Containing Papers of a Mathematical or Physical Character , volume =. 1922 , publisher =. doi:10.1098/rsta.1922.0009 , url =

  2. [2]

    IEEE transactions on pattern analysis and machine intelligence , volume=

    Representation learning: A review and new perspectives , author=. IEEE transactions on pattern analysis and machine intelligence , volume=. 2013 , publisher=

  3. [3]

    arXiv preprint physics/0004057 , year=

    The information bottleneck method , author=. arXiv preprint physics/0004057 , year=

  4. [4]

    International Conference on Learning Representations , year=

    Deep Variational Information Bottleneck , author=. International Conference on Learning Representations , year=

  5. [5]

    Journal of Machine Learning Research , volume=

    Emergence of Invariance and Disentanglement in Deep Representations , author=. Journal of Machine Learning Research , volume=. 2018 , url=

  6. [6]

    IEEE transactions on pattern analysis and machine intelligence , volume=

    Information dropout: Learning optimal representations through noisy computation , author=. IEEE transactions on pattern analysis and machine intelligence , volume=. 2018 , publisher=

  7. [7]

    Domain-Adversarial Training of Neural Networks , journal =

    Yaroslav Ganin and Evgeniya Ustinova and Hana Ajakan and Pascal Germain and Hugo Larochelle and Fran. Domain-Adversarial Training of Neural Networks , journal =. 2016 , volume =

  8. [8]

    2020 , eprint=

    Invariant Risk Minimization , author=. 2020 , eprint=

  9. [9]

    Proceedings of the 37th International Conference on Machine Learning , pages =

    Invariant Risk Minimization Games , author =. Proceedings of the 37th International Conference on Machine Learning , pages =. 2020 , editor =

  10. [10]

    Proceedings of the 38th International Conference on Machine Learning , pages =

    Out-of-Distribution Generalization via Risk Extrapolation (REx) , author =. Proceedings of the 38th International Conference on Machine Learning , pages =. 2021 , editor =

  11. [11]

    Proceedings of The 24th International Conference on Artificial Intelligence and Statistics , pages =

    Does Invariant Risk Minimization Capture Invariance? , author =. Proceedings of The 24th International Conference on Artificial Intelligence and Statistics , pages =. 2021 , editor =

  12. [12]

    Proceedings of The 33rd International Conference on Machine Learning , pages =

    Group Equivariant Convolutional Networks , author =. Proceedings of The 33rd International Conference on Machine Learning , pages =. 2016 , editor =

  13. [13]

    arXiv preprint arXiv:2104.13478 , year=

    Geometric deep learning: Grids, groups, graphs, geodesics, and gauges , author=. arXiv preprint arXiv:2104.13478 , year=

  14. [14]

    Proceedings of the 37th International Conference on Machine Learning , pages=

    A Simple Framework for Contrastive Learning of Visual Representations , author=. Proceedings of the 37th International Conference on Machine Learning , pages=. 2020 , series=

  15. [15]

    Proceedings of the IEEE/CVF conference on computer vision and pattern recognition , pages=

    Momentum contrast for unsupervised visual representation learning , author=. Proceedings of the IEEE/CVF conference on computer vision and pattern recognition , pages=

  16. [16]

    Advances in neural information processing systems , volume=

    Bootstrap your own latent-a new approach to self-supervised learning , author=. Advances in neural information processing systems , volume=

  17. [17]

    Irina Higgins and Loic Matthey and Arka Pal and Christopher Burgess and Xavier Glorot and Matthew Botvinick and Shakir Mohamed and Alexander Lerchner , booktitle=. beta-. 2017 , url=

  18. [18]

    Proceedings of the 35th International Conference on Machine Learning , pages =

    Disentangling by Factorising , author =. Proceedings of the 35th International Conference on Machine Learning , pages =. 2018 , editor =

  19. [19]

    Proceedings of the 36th International Conference on Machine Learning , pages =

    Challenging Common Assumptions in the Unsupervised Learning of Disentangled Representations , author =. Proceedings of the 36th International Conference on Machine Learning , pages =. 2019 , editor =

  20. [20]

    Proceedings of the Twenty-Second International Conference on Artificial Intelligence and Statistics , pages =

    Nonlinear ICA Using Auxiliary Variables and Generalized Contrastive Learning , author =. Proceedings of the Twenty-Second International Conference on Artificial Intelligence and Statistics , pages =. 2019 , editor =

  21. [21]

    Proceedings of the Twenty Third International Conference on Artificial Intelligence and Statistics , pages =

    Variational Autoencoders and Nonlinear ICA: A Unifying Framework , author =. Proceedings of the Twenty Third International Conference on Artificial Intelligence and Statistics , pages =. 2020 , editor =

  22. [22]

    2017 , url=

    Understanding intermediate layers using linear classifier probes , author=. 2017 , url=

  23. [23]

    Hoffmann, J., Borgeaud, S., Mensch, A., Buchatskaya, E., Cai, T., Rutherford, E., Casas, D

    Hewitt, John and Liang, Percy. Designing and Interpreting Probes with Control Tasks. Proceedings of the 2019 Conference on Empirical Methods in Natural Language Processing and the 9th International Joint Conference on Natural Language Processing (EMNLP-IJCNLP). 2019. doi:10.18653/v1/D19-1275

  24. [24]

    Probing Classifiers: Promises, Shortcomings, and Advances

    Belinkov, Yonatan. Probing Classifiers: Promises, Shortcomings, and Advances. Computational Linguistics. 2022. doi:10.1162/coli_a_00422

  25. [25]

    Proceedings of the AAAI Conference on Artificial Intelligence , volume=

    Invariant information bottleneck for domain generalization , author=. Proceedings of the AAAI Conference on Artificial Intelligence , volume=

  26. [26]

    Proceedings of the IEEE , volume=

    Toward causal representation learning , author=. Proceedings of the IEEE , volume=. 2021 , publisher=

  27. [27]

    Advances in neural information processing systems , volume=

    Svcca: Singular vector canonical correlation analysis for deep learning dynamics and interpretability , author=. Advances in neural information processing systems , volume=

  28. [28]

    International conference on machine learning , pages=

    Similarity of neural network representations revisited , author=. International conference on machine learning , pages=. 2019 , organization=

  29. [29]

    Nature Machine Intelligence , volume=

    Shortcut learning in deep neural networks , author=. Nature Machine Intelligence , volume=. 2020 , publisher=

  30. [30]

    International Conference on Learning Representations , year=

    Distributionally Robust Neural Networks , author=. International Conference on Learning Representations , year=

  31. [31]

    Proceedings of the 38th International Conference on Machine Learning , pages =

    On Linear Identifiability of Learned Representations , author =. Proceedings of the 38th International Conference on Machine Learning , pages =. 2021 , editor =

  32. [32]

    ACM Computing Surveys , volume=

    Similarity of neural network models: A survey of functional and representational measures , author=. ACM Computing Surveys , volume=. 2025 , publisher=

  33. [33]

    2015 ieee information theory workshop (itw) , pages=

    Deep learning and the information bottleneck principle , author=. 2015 ieee information theory workshop (itw) , pages=. 2015 , organization=

  34. [34]

    International conference on machine learning , pages=

    Disentangling by factorising , author=. International conference on machine learning , pages=. 2018 , organization=

  35. [35]

    arXiv preprint arXiv:1611.03530 , year=

    Understanding deep learning requires rethinking generalization , author=. arXiv preprint arXiv:1611.03530 , year=

  36. [36]

    2013 , publisher=

    Statistical decision theory and Bayesian analysis , author=. 2013 , publisher=

  37. [37]

    2008 , publisher=

    Statistical Decision Theory: Estimation, Testing, and Selection , author=. 2008 , publisher=

  38. [38]

    Journal of the Italian Statistical Society , volume=

    On identifiability of parametric statistical models , author=. Journal of the Italian Statistical Society , volume=. 1994 , publisher=

  39. [39]

    Model Identifiability , booktitle =

    Huang, Guan-Hua , publisher =. Model Identifiability , booktitle =. doi:https://doi.org/10.1002/9781118445112.stat06411.pub2 , url =. https://onlinelibrary.wiley.com/doi/pdf/10.1002/9781118445112.stat06411.pub2 , year =

  40. [40]

    arXiv preprint arXiv:2002.06041 , year=

    A general theory of identification , author=. arXiv preprint arXiv:2002.06041 , year=

  41. [41]

    Journal of Statistical Mechanics: Theory and Experiment , volume=

    On the information bottleneck theory of deep learning , author=. Journal of Statistical Mechanics: Theory and Experiment , volume=. 2019 , publisher=

  42. [42]

    Causal Inference by using Invariant Prediction: Identification and Confidence Intervals , journal =

    Peters, Jonas and B. Causal Inference by using Invariant Prediction: Identification and Confidence Intervals , journal =. 2016 , month =. doi:10.1111/rssb.12167 , url =

  43. [43]

    1998 , publisher=

    Theory of point estimation , author=. 1998 , publisher=

  44. [44]

    Regression Graphics: Ideas for Studying Regressions through Graphics , author=

  45. [45]

    International Conference on Machine Learning , pages=

    Train Faster, Generalize Better: Stability of Stochastic Gradient Descent , author=. International Conference on Machine Learning , pages=

  46. [46]

    Advances in neural information processing systems , volume=

    Exploring generalization in deep learning , author=. Advances in neural information processing systems , volume=

  47. [47]

    Journal of Machine Learning Research , volume=

    The Implicit Bias of Gradient Descent on Separable Data , author=. Journal of Machine Learning Research , volume=

  48. [48]

    The Fourteenth International Conference on Learning Representations , year=

    Gauge-invariant representation holonomy , author=. The Fourteenth International Conference on Learning Representations , year=

  49. [49]

    2026 , eprint=

    Training Memory in Deep Neural Networks: Mechanisms, Evidence, and Measurement Gaps , author=. 2026 , eprint=

  50. [50]

    The Fourteenth International Conference on Learning Representations , year =

    Statistical and Structural Identifiability in Representation Learning , author =. The Fourteenth International Conference on Learning Representations , year =

  51. [51]

    Proceedings of the 2021 Conference on Empirical Methods in Natural Language Processing , pages =

    Conditional Probing: Measuring Usable Information Beyond a Baseline , author =. Proceedings of the 2021 Conference on Empirical Methods in Natural Language Processing , pages =. 2021 , publisher =

  52. [52]

    Advances in Neural Information Processing Systems , volume =

    Improving Self-Supervised Learning by Characterizing Idealized Representations , author =. Advances in Neural Information Processing Systems , volume =. 2022 , url =

  53. [53]

    Journal of Machine Learning Research , volume =

    Underspecification Presents Challenges for Credibility in Modern Machine Learning , author =. Journal of Machine Learning Research , volume =. 2022 , url =