Distilling a Neural Network Into a Soft Decision Tree

Nicholas Frosst , Geoffrey Hinton

Authors on Pith no claims yet

classification 💻 cs.LG cs.AIstat.ML

keywords decisionneuralparticularclassificationdatahierarchicalinputknowledge

read the original abstract

Deep neural networks have proved to be a very effective way to perform classification tasks. They excel when the input data is high dimensional, the relationship between the input and the output is complicated, and the number of labeled training examples is large. But it is hard to explain why a learned network makes a particular classification decision on a particular test case. This is due to their reliance on distributed hierarchical representations. If we could take the knowledge acquired by the neural net and express the same knowledge in a model that relies on hierarchical decisions instead, explaining a particular decision would be much easier. We describe a way of using a trained neural net to create a type of soft decision tree that generalizes better than one learned directly from the training data.

This paper has not been read by Pith yet.

discussion (0)

Forward citations

Cited by 4 Pith papers

Reviewed papers in the Pith corpus that reference this work. Sorted by Pith novelty score.

TILT: Target-induced loss tilting under covariate shift
cs.LG 2026-05 conditional novelty 7.0

TILT adds a target-data penalty on an auxiliary predictor component to induce effective importance weighting for unsupervised domain adaptation under covariate shift.
Minimax Rates and Spectral Distillation for Tree Ensembles
stat.ML 2026-05 unverdicted novelty 7.0

Spectral analysis of tree ensembles produces minimax rates for random forests governed by kernel eigenvalue decay and enables distillation of RFs and GBMs into compact models via leading eigenfunctions and singular vectors.
Approximation-Free Differentiable Oblique Decision Trees
cs.LG 2026-05 unverdicted novelty 7.0

DTSemNet gives an exact, invertible neural-network encoding of hard oblique decision trees that supports direct gradient training for both classification and regression without probabilistic softening or quantized estimators.
SaliencyDecor: Enhancing Neural Network Interpretability through Feature Decorrelation
cs.CV 2026-04 unverdicted novelty 5.0

Enforcing feature decorrelation during training produces sharper saliency maps and higher accuracy on image classification benchmarks.