pith. sign in

arxiv: 1811.09720 · v1 · pith:LGQUG5CVnew · submitted 2018-11-23 · 💻 cs.LG · stat.ML

Representer Point Selection for Explaining Deep Neural Networks

classification 💻 cs.LG stat.ML
keywords trainingnetworkpointpointsrepresentercorrespondingneuralvalues
0
0 comments X
read the original abstract

We propose to explain the predictions of a deep neural network, by pointing to the set of what we call representer points in the training set, for a given test point prediction. Specifically, we show that we can decompose the pre-activation prediction of a neural network into a linear combination of activations of training points, with the weights corresponding to what we call representer values, which thus capture the importance of that training point on the learned parameters of the network. But it provides a deeper understanding of the network than simply training point influence: with positive representer values corresponding to excitatory training points, and negative values corresponding to inhibitory points, which as we show provides considerably more insight. Our method is also much more scalable, allowing for real-time feedback in a manner not feasible with influence functions.

This paper has not been read by Pith yet.

discussion (0)

Sign in with ORCID, Apple, or X to comment. Anyone can read and Pith papers without signing in.

Forward citations

Cited by 2 Pith papers

Reviewed papers in the Pith corpus that reference this work. Sorted by Pith novelty score.

  1. PRISM: Preference-Aware Influence Function Based Data Selection Method for Efficient Fine-Tuning

    cs.LG 2026-05 unverdicted novelty 6.0

    PRISM weights target examples by the current model's preference to build a better representation for influence-function scoring of training samples in efficient LLM fine-tuning.

  2. PRISM: Preference-Aware Influence Function Based Data Selection Method for Efficient Fine-Tuning

    cs.LG 2026-05 unverdicted novelty 6.0

    PRISM weights target examples by model preference to build an improved direction for influence-based data selection in LLM fine-tuning.