Most ReLU Networks Admit Identifiable Parameters

· 2026 · cs.LG · arXiv 2605.03601

3 Pith papers cite this work. Polarity classification is still indexing.

3 Pith papers citing it

open full Pith review browse 3 citing papers arXiv PDF

abstract

We study the realization map of deep ReLU networks, focusing on when a function determines its parameters up to scaling and permutation. To analyze hidden redundancies beyond these standard symmetries, we introduce a framework based on weighted polyhedral complexes. Our main result shows that for every architecture whose input and hidden layers have width at least two, there exists an open set of identifiable parameters. This implies that the functional dimension of every such architecture is exactly the number of parameters minus the number of hidden neurons. We further show that minimal functional representations can still have non-trivial parameter redundancies. Finally, we establish a generic depth hierarchy, whereby for an open set of parameters the realized function cannot be represented generically by any shallower network.

representative citing papers

Conservation Laws from Data Symmetry in Neural Networks

cs.LG · 2026-06-09 · unverdicted · novelty 7.0

Data symmetries generically do not induce conserved quantities in NN training for analytic non-polynomial losses, but can for MSE with tensorizable networks.

On the fibers and semi-algebraicity of ReLU neuromanifolds

math.AG · 2026-06-01 · unverdicted · novelty 7.0

ReLU neuromanifolds are not semi-algebraic quotients of weight spaces; honest opens are conjectured semi-algebraic and proven Zariski in the shallow case.

Singular Learning and Occam's Razor in Deep Monomial Networks

cs.LG · 2026-06-26 · unverdicted · novelty 5.0

For large monomial activation degree, critical points in deep fully-connected networks coincide exactly with subnetwork configurations where neurons are inactive or redundant.

citing papers explorer

Showing 3 of 3 citing papers.

Conservation Laws from Data Symmetry in Neural Networks cs.LG · 2026-06-09 · unverdicted · none · ref 35 · internal anchor
Data symmetries generically do not induce conserved quantities in NN training for analytic non-polynomial losses, but can for MSE with tensorizable networks.
On the fibers and semi-algebraicity of ReLU neuromanifolds math.AG · 2026-06-01 · unverdicted · none · ref 1 · internal anchor
ReLU neuromanifolds are not semi-algebraic quotients of weight spaces; honest opens are conjectured semi-algebraic and proven Zariski in the shallow case.
Singular Learning and Occam's Razor in Deep Monomial Networks cs.LG · 2026-06-26 · unverdicted · none · ref 6 · internal anchor
For large monomial activation degree, critical points in deep fully-connected networks coincide exactly with subnetwork configurations where neurons are inactive or redundant.

Most ReLU Networks Admit Identifiable Parameters

fields

years

verdicts

representative citing papers

citing papers explorer