pith. sign in

arxiv: 1602.03616 · v2 · pith:56IV7CFMnew · submitted 2016-02-11 · 💻 cs.NE · cs.CV

Multifaceted Feature Visualization: Uncovering the Different Types of Features Learned By Each Neuron in Deep Neural Networks

classification 💻 cs.NE cs.CV
keywords neuronimagesvisualizationactivateactivationdeepdifferentfeature
0
0 comments X
read the original abstract

We can better understand deep neural networks by identifying which features each of their neurons have learned to detect. To do so, researchers have created Deep Visualization techniques including activation maximization, which synthetically generates inputs (e.g. images) that maximally activate each neuron. A limitation of current techniques is that they assume each neuron detects only one type of feature, but we know that neurons can be multifaceted, in that they fire in response to many different types of features: for example, a grocery store class neuron must activate either for rows of produce or for a storefront. Previous activation maximization techniques constructed images without regard for the multiple different facets of a neuron, creating inappropriate mixes of colors, parts of objects, scales, orientations, etc. Here, we introduce an algorithm that explicitly uncovers the multiple facets of each neuron by producing a synthetic visualization of each of the types of images that activate a neuron. We also introduce regularization methods that produce state-of-the-art results in terms of the interpretability of images obtained by activation maximization. By separately synthesizing each type of image a neuron fires in response to, the visualizations have more appropriate colors and coherent global structure. Multifaceted feature visualization thus provides a clearer and more comprehensive description of the role of each neuron.

This paper has not been read by Pith yet.

discussion (0)

Sign in with ORCID, Apple, or X to comment. Anyone can read and Pith papers without signing in.

Forward citations

Cited by 4 Pith papers

Reviewed papers in the Pith corpus that reference this work. Sorted by Pith novelty score.

  1. Toy Combinatorial Interpretability Models Reveal Lottery Tickets in Early Feature Space

    cs.LG 2026-05 unverdicted novelty 7.0

    In a combinatorial toy setting, winning lottery tickets preserve families of compatible feature locations in early feature space that balance proximity to final codes with low interference, rather than specific weight...

  2. Deep Dreams Are Made of This: Visualizing Monosemantic Features in Diffusion Models

    cs.LG 2026-05 unverdicted novelty 7.0

    LVO applies optimization-based feature visualization to latent diffusion models after disentangling their representations with sparse autoencoders, yielding recognizable concept images on a fine-tuned Stable Diffusion...

  3. Towards Debugging Deep Neural Networks by Generating Speech Utterances

    cs.LG 2019-07 unverdicted novelty 5.0

    Activation maximization applied to a speech command DNN, followed by WaveNet synthesis, produces class-specific utterances that human evaluators can interpret, supporting its use for model debugging.

  4. Generative Counterfactual Introspection for Explainable Deep Learning

    cs.LG 2019-07 unverdicted novelty 5.0

    A generative-model-driven introspection method produces counterfactual image edits to explain deep neural network predictions on MNIST and CelebA.