Understanding over-parameterized deep networks by geometrization

· 2019 · cs.LG · arXiv 1902.03793

2 Pith papers cite this work. Polarity classification is still indexing.

2 Pith papers citing it

open full Pith review browse 2 citing papers arXiv PDF

abstract

A complete understanding of the widely used over-parameterized deep networks is a key step for AI. In this work we try to give a geometric picture of over-parameterized deep networks using our geometrization scheme. We show that the Riemannian geometry of network complexity plays a key role in understanding the basic properties of over-parameterizaed deep networks, including the generalization, convergence and parameter sensitivity. We also point out deep networks share lots of similarities with quantum computation systems. This can be regarded as a strong support of our proposal that geometrization is not only the bible for physics, it is also the key idea to understand deep learning systems.

representative citing papers

Deep network as memory space: complexity, generalization, disentangled representation and interpretability

cs.LG · 2019-07-12 · unverdicted · novelty 5.0

Deep networks are framed as memory spaces whose complexity is defined by a Fisher metric, with the least action principle linking this complexity to generalization and disentanglement for better interpretability.

Gauge theory and twins paradox of disentangled representations

cs.LG · 2019-06-24 · unverdicted · novelty 3.0

Authors propose a fibre bundle gauge theory model for disentangled representations and connect it to the relativity twins paradox.

citing papers explorer

Showing 2 of 2 citing papers.

Deep network as memory space: complexity, generalization, disentangled representation and interpretability cs.LG · 2019-07-12 · unverdicted · none · ref 2 · internal anchor
Deep networks are framed as memory spaces whose complexity is defined by a Fisher metric, with the least action principle linking this complexity to generalization and disentanglement for better interpretability.
Gauge theory and twins paradox of disentangled representations cs.LG · 2019-06-24 · unverdicted · none · ref 4 · internal anchor
Authors propose a fibre bundle gauge theory model for disentangled representations and connect it to the relativity twins paradox.

Understanding over-parameterized deep networks by geometrization

fields

years

verdicts

representative citing papers

citing papers explorer