Tensorizing Neural Networks

Alexander Novikov; Anton Osokin; Dmitry Podoprikhin; Dmitry Vetrov

arxiv: 1509.06569 · v2 · pith:EM52TR3Pnew · submitted 2015-09-22 · 💻 cs.LG · cs.NE

Tensorizing Neural Networks

Alexander Novikov , Dmitry Podoprikhin , Anton Osokin , Dmitry Vetrov This is my paper

classification 💻 cs.LG cs.NE

keywords factorfully-connectednetworkscompressiondeepdenselayerlayers

0 comments

read the original abstract

Deep neural networks currently demonstrate state-of-the-art performance in several domains. At the same time, models of this class are very demanding in terms of computational resources. In particular, a large amount of memory is required by commonly used fully-connected layers, making it hard to use the models on low-end devices and stopping the further increase of the model size. In this paper we convert the dense weight matrices of the fully-connected layers to the Tensor Train format such that the number of parameters is reduced by a huge factor and at the same time the expressive power of the layer is preserved. In particular, for the Very Deep VGG networks we report the compression factor of the dense weight matrix of a fully-connected layer up to 200000 times leading to the compression factor of the whole network up to 7 times.

This paper has not been read by Pith yet.

discussion (0)

Forward citations

Cited by 2 Pith papers

Reviewed papers in the Pith corpus that reference this work. Sorted by Pith novelty score.

Synchronous and Asynchronous Parallelism Approaches for Generalized Canonical Polyadic Tensor Decomposition with GenTen
math.NA 2026-05 unverdicted novelty 6.0

Presents new synchronous and asynchronous parallel approaches for GCP tensor decomposition and evaluates computational cost and accuracy on synthetic and real-world datasets.
Vanishing Contributions: A Unified Framework for Smooth and Iterative Model Compression
cs.LG 2025-10 unverdicted novelty 5.0

VCON is a unified framework for smooth iterative DNN compression that uses parallel execution and an affine combination to progressively replace the original model with its compressed form during fine-tuning.