Exponential expressivity in deep neural networks through transient chaos

· 2016 · stat.ML · arXiv 1606.05340

3 Pith papers cite this work. Polarity classification is still indexing.

3 Pith papers citing it

open full Pith review browse 3 citing papers arXiv PDF

abstract

We combine Riemannian geometry with the mean field theory of high dimensional chaos to study the nature of signal propagation in generic, deep neural networks with random weights. Our results reveal an order-to-chaos expressivity phase transition, with networks in the chaotic phase computing nonlinear functions whose global curvature grows exponentially with depth but not width. We prove this generic class of deep random functions cannot be efficiently computed by any shallow network, going beyond prior work restricted to the analysis of single functions. Moreover, we formalize and quantitatively demonstrate the long conjectured idea that deep networks can disentangle highly curved manifolds in input space into flat manifolds in hidden space. Our theoretical analysis of the expressive power of deep networks broadly applies to arbitrary nonlinearities, and provides a quantitative underpinning for previously abstract notions about the geometry of deep functions.

citation-role summary

background 1

citation-polarity summary

background 1

representative citing papers

Spectral phase transitions and trainability in neural network learning dynamics

cond-mat.dis-nn · 2026-06-26 · unverdicted · novelty 6.0

SGD on neural network weights induces a BBP phase transition that detaches signal eigenvalues from the random bulk, yielding an analytically solvable phase diagram for trainability in a linear teacher-student model.

Information as Maximum-Caliber Deviation: A bridge between Integrated Information Theory and the Free Energy Principle

q-bio.NC · 2026-05-03 · unverdicted · novelty 6.0

Information defined as maximum-caliber deviation derives IIT 3.0 cause-effect repertoires from constrained entropy maximization and equates to prediction error under CLT and LDT.

Bayesian Inference with Shaped Deep Non-linear MLPs

math.ST · 2026-05-29 · unverdicted · novelty 5.0

In the LP/N = Θ(1) regime, Bayesian predictive posteriors for deep MLPs equal those of data-dependent kernels to first order, with a criterion identifying data processes that benefit from larger effective depth.

citing papers explorer

Showing 0 of 0 citing papers after filters.

No citing papers match the current filters.

Exponential expressivity in deep neural networks through transient chaos

citation-role summary

citation-polarity summary

fields

years

verdicts

roles

polarities

representative citing papers

citing papers explorer