pith. machine review for the scientific record. sign in

arxiv: 1412.2693 · v4 · submitted 2014-12-08 · 💻 cs.LG · cs.NE· stat.ML

Recognition: unknown

Provable Methods for Training Neural Networks with Sparse Connectivity

Hanie Sedghi , Anima Anandkumar

Authors on Pith no claims yet
classification 💻 cs.LG cs.NEstat.ML
keywords networksconnectivityneuralsparsetrainingadoptedapproachesconditions
0
0 comments X
read the original abstract

We provide novel guaranteed approaches for training feedforward neural networks with sparse connectivity. We leverage on the techniques developed previously for learning linear networks and show that they can also be effectively adopted to learn non-linear networks. We operate on the moments involving label and the score function of the input, and show that their factorization provably yields the weight matrix of the first layer of a deep network under mild conditions. In practice, the output of our method can be employed as effective initializers for gradient descent.

This paper has not been read by Pith yet.

discussion (0)

Sign in with ORCID, Apple, or X to comment. Anyone can read and Pith papers without signing in.

Forward citations

Cited by 1 Pith paper

Reviewed papers in the Pith corpus that reference this work. Sorted by Pith novelty score.

  1. Self-Play Fine-Tuning Converts Weak Language Models to Strong Language Models

    cs.LG 2024-01 unverdicted novelty 6.0

    SPIN lets weak LLMs become strong by self-generating training data from previous model versions and training to prefer human-annotated responses over its own outputs, outperforming DPO even with extra GPT-4 data on be...