Loss landscapes and optimization in over- parameterized non-linear systems and neural networks

Chaoyue Liu, Libin Zhu, Mikhail Belkin · 2022

2 Pith papers cite this work. Polarity classification is still indexing.

2 Pith papers citing it

browse 2 citing papers

representative citing papers

Flat Channels to Infinity in Neural Loss Landscapes

cs.LG · 2025-06-17 · unverdicted · novelty 7.0

Neural loss landscapes contain flat channels to infinity along which gradient flow leads pairs of neurons to implement gated linear units.

On the Convergence Theory of Pipeline Gradient-based Analog In-memory Training

cs.LG · 2024-10-19 · unverdicted · novelty 6.0

Analog-SGD-AP converges with iteration complexity O(ε^{-2} + ε^{-1}) for multi-layer DNNs on AIMC hardware despite analog weight-update imperfections and asynchronous stale gradients.

citing papers explorer

Showing 2 of 2 citing papers.

Flat Channels to Infinity in Neural Loss Landscapes cs.LG · 2025-06-17 · unverdicted · none · ref 30
Neural loss landscapes contain flat channels to infinity along which gradient flow leads pairs of neurons to implement gated linear units.
On the Convergence Theory of Pipeline Gradient-based Analog In-memory Training cs.LG · 2024-10-19 · unverdicted · none · ref 28
Analog-SGD-AP converges with iteration complexity O(ε^{-2} + ε^{-1}) for multi-layer DNNs on AIMC hardware despite analog weight-update imperfections and asynchronous stale gradients.

Loss landscapes and optimization in over- parameterized non-linear systems and neural networks

fields

years

verdicts

representative citing papers

citing papers explorer