On the importance of single directions for generalization

Ari S. Morcos; David G.T. Barrett; Matthew Botvinick; Neil C. Rabinowitz

arxiv: 1803.06959 · v4 · pith:QP7LUMWYnew · submitted 2018-03-19 · 📊 stat.ML · cs.AI· cs.LG· cs.NE

On the importance of single directions for generalization

Ari S. Morcos , David G.T. Barrett , Neil C. Rabinowitz , Matthew Botvinick This is my paper

classification 📊 stat.ML cs.AIcs.LGcs.NE

keywords networkssingleunitsacrossdatasetsdirectionsgeneralizationimportance

0 comments

read the original abstract

Despite their ability to memorize large datasets, deep neural networks often achieve good generalization performance. However, the differences between the learned solutions of networks which generalize and those which do not remain unclear. Additionally, the tuning properties of single directions (defined as the activation of a single unit or some linear combination of units in response to some input) have been highlighted, but their importance has not been evaluated. Here, we connect these lines of inquiry to demonstrate that a network's reliance on single directions is a good predictor of its generalization performance, across networks trained on datasets with different fractions of corrupted labels, across ensembles of networks trained on datasets with unmodified labels, across different hyperparameters, and over the course of training. While dropout only regularizes this quantity up to a point, batch normalization implicitly discourages single direction reliance, in part by decreasing the class selectivity of individual units. Finally, we find that class selectivity is a poor predictor of task importance, suggesting not only that networks which generalize well minimize their dependence on individual units by reducing their selectivity, but also that individually selective units may not be necessary for strong network performance.

This paper has not been read by Pith yet.

discussion (0)

Forward citations

Cited by 5 Pith papers

Reviewed papers in the Pith corpus that reference this work. Sorted by Pith novelty score.

Toy Models of Superposition
cs.LG 2022-09 accept novelty 8.0

Toy models demonstrate that polysemanticity arises when neural networks store more sparse features than neurons via superposition, producing a phase transition tied to polytope geometry and increased adversarial vulne...
Computational Lesions in Multilingual Language Models Separate Shared and Language-specific Brain Alignment
cs.CL 2026-04 unverdicted novelty 7.0

Lesioning a shared core in multilingual LLMs drops whole-brain fMRI encoding correlation by 60.32%, while language-specific lesions selectively weaken predictions only for the matched native language.
Neurons Speak in Ranges: Breaking Free from Discrete Neuronal Attribution
cs.LG 2025-02 unverdicted novelty 7.0

Neurons exhibit concept-conditioned activation ranges forming Gaussian-like distributions with minimal overlap, and range-based interventions via NeuronLens outperform neuron-level masking in targeted manipulation wit...
Multi-task Self-Supervised Learning for Human Activity Detection
cs.LG 2019-07 unverdicted novelty 6.0

A multi-task self-supervised approach trains a temporal CNN to detect transformations on sensory data, yielding features that match or exceed fully supervised performance in semi-supervised and transfer settings for s...
Single-bit-per-weight deep convolutional neural networks without batch-normalization layers for embedded systems
cs.LG 2019-07 unverdicted novelty 4.0

Experiments show that shifted-ReLU layers can replace batch-normalization in single-bit-weight wide residual networks on CIFAR-10/100 and ImageNet without consistent accuracy penalty.