Ultimate tensorization: compressing convolutional and FC layers alike

Alexander Novikov; Dmitry Podoprikhin; Dmitry Vetrov; Timur Garipov

arxiv: 1611.03214 · v1 · pith:Y5KJ3YCNnew · submitted 2016-11-10 · 💻 cs.LG

Ultimate tensorization: compressing convolutional and FC layers alike

Timur Garipov , Dmitry Podoprikhin , Alexander Novikov , Dmitry Vetrov This is my paper

classification 💻 cs.LG

keywords convolutionallayerscompresstensorcompressingframeworkfully-connectedkernel

0 comments

read the original abstract

Convolutional neural networks excel in image recognition tasks, but this comes at the cost of high computational and memory complexity. To tackle this problem, [1] developed a tensor factorization framework to compress fully-connected layers. In this paper, we focus on compressing convolutional layers. We show that while the direct application of the tensor framework [1] to the 4-dimensional kernel of convolution does compress the layer, we can do better. We reshape the convolutional kernel into a tensor of higher order and factorize it. We combine the proposed approach with the previous work to compress both convolutional and fully-connected layers of a network and achieve 80x network compression rate with 1.1% accuracy drop on the CIFAR-10 dataset.

This paper has not been read by Pith yet.

discussion (0)

Forward citations

Cited by 1 Pith paper

Reviewed papers in the Pith corpus that reference this work. Sorted by Pith novelty score.

Strategic Over-Parameterization for Generalizable Low-Rank Adaptation
cs.LG 2026-05 unverdicted novelty 5.0

LoRA-Over injects auxiliary parameters into low-rank adapters during training and decomposes them back into standard LoRA at inference, with static or dynamic scheduling to allocate extra capacity where needed, yieldi...