Multifaceted Feature Visualization: Uncovering the Different Types of Features Learned By Each Neuron in Deep Neural Networks

Anh Nguyen; Jason Yosinski; Jeff Clune

arxiv: 1602.03616 · v2 · pith:56IV7CFMnew · submitted 2016-02-11 · 💻 cs.NE · cs.CV

Multifaceted Feature Visualization: Uncovering the Different Types of Features Learned By Each Neuron in Deep Neural Networks

Anh Nguyen , Jason Yosinski , Jeff Clune This is my paper

classification 💻 cs.NE cs.CV

keywords neuronimagesvisualizationactivateactivationdeepdifferentfeature

0 comments

read the original abstract

We can better understand deep neural networks by identifying which features each of their neurons have learned to detect. To do so, researchers have created Deep Visualization techniques including activation maximization, which synthetically generates inputs (e.g. images) that maximally activate each neuron. A limitation of current techniques is that they assume each neuron detects only one type of feature, but we know that neurons can be multifaceted, in that they fire in response to many different types of features: for example, a grocery store class neuron must activate either for rows of produce or for a storefront. Previous activation maximization techniques constructed images without regard for the multiple different facets of a neuron, creating inappropriate mixes of colors, parts of objects, scales, orientations, etc. Here, we introduce an algorithm that explicitly uncovers the multiple facets of each neuron by producing a synthetic visualization of each of the types of images that activate a neuron. We also introduce regularization methods that produce state-of-the-art results in terms of the interpretability of images obtained by activation maximization. By separately synthesizing each type of image a neuron fires in response to, the visualizations have more appropriate colors and coherent global structure. Multifaceted feature visualization thus provides a clearer and more comprehensive description of the role of each neuron.

This paper has not been read by Pith yet.

discussion (0)

Forward citations

Cited by 4 Pith papers

Reviewed papers in the Pith corpus that reference this work. Sorted by Pith novelty score.

Toy Combinatorial Interpretability Models Reveal Lottery Tickets in Early Feature Space
cs.LG 2026-05 unverdicted novelty 7.0

In a combinatorial toy setting, winning lottery tickets preserve families of compatible feature locations in early feature space that balance proximity to final codes with low interference, rather than specific weight...
Deep Dreams Are Made of This: Visualizing Monosemantic Features in Diffusion Models
cs.LG 2026-05 unverdicted novelty 7.0

LVO applies optimization-based feature visualization to latent diffusion models after disentangling their representations with sparse autoencoders, yielding recognizable concept images on a fine-tuned Stable Diffusion...
Towards Debugging Deep Neural Networks by Generating Speech Utterances
cs.LG 2019-07 unverdicted novelty 5.0

Activation maximization applied to a speech command DNN, followed by WaveNet synthesis, produces class-specific utterances that human evaluators can interpret, supporting its use for model debugging.
Generative Counterfactual Introspection for Explainable Deep Learning
cs.LG 2019-07 unverdicted novelty 5.0

A generative-model-driven introspection method produces counterfactual image edits to explain deep neural network predictions on MNIST and CelebA.