Filter Pruning via Geometric Median for Deep Convolutional Neural Networks Acceleration

Ping Liu; Yang He; Yi Yang; Zhilan Hu; Ziwei Wang

arxiv: 1811.00250 · v3 · pith:RJP6CA5Bnew · submitted 2018-11-01 · 💻 cs.CV

Filter Pruning via Geometric Median for Deep Convolutional Neural Networks Acceleration

Yang He , Ping Liu , Ziwei Wang , Zhilan Hu , Yi Yang This is my paper

classification 💻 cs.CV

keywords filtersfpgmpruningfilternormaccuracyconvolutionalcriterion

0 comments

read the original abstract

Previous works utilized ''smaller-norm-less-important'' criterion to prune filters with smaller norm values in a convolutional neural network. In this paper, we analyze this norm-based criterion and point out that its effectiveness depends on two requirements that are not always met: (1) the norm deviation of the filters should be large; (2) the minimum norm of the filters should be small. To solve this problem, we propose a novel filter pruning method, namely Filter Pruning via Geometric Median (FPGM), to compress the model regardless of those two requirements. Unlike previous methods, FPGM compresses CNN models by pruning filters with redundancy, rather than those with ''relatively less'' importance. When applied to two image classification benchmarks, our method validates its usefulness and strengths. Notably, on CIFAR-10, FPGM reduces more than 52% FLOPs on ResNet-110 with even 2.69% relative accuracy improvement. Moreover, on ILSVRC-2012, FPGM reduces more than 42% FLOPs on ResNet-101 without top-5 accuracy drop, which has advanced the state-of-the-art. Code is publicly available on GitHub: https://github.com/he-y/filter-pruning-geometric-median

This paper has not been read by Pith yet.

discussion (0)

Forward citations

Cited by 4 Pith papers

Reviewed papers in the Pith corpus that reference this work. Sorted by Pith novelty score.

COP: Customized Deep Model Compression via Regularized Correlation-Based Filter-Level Pruning
cs.CV 2019-06 unverdicted novelty 6.0

COP prunes CNN filters using correlation-based importance with global normalization and dual regularization on parameter quantity and FLOPs to enable customized compression.
Vanishing Contributions: A Unified Framework for Smooth and Iterative Model Compression
cs.LG 2025-10 unverdicted novelty 5.0

VCON is a unified framework for smooth iterative DNN compression that uses parallel execution and an affine combination to progressively replace the original model with its compressed form during fine-tuning.
Deep network as memory space: complexity, generalization, disentangled representation and interpretability
cs.LG 2019-07 unverdicted novelty 5.0

Deep networks are framed as memory spaces whose complexity is defined by a Fisher metric, with the least action principle linking this complexity to generalization and disentanglement for better interpretability.
Weight Normalization based Quantization for Deep Neural Network Compression
cs.LG 2019-07 unverdicted novelty 4.0

WNQ uses weight normalization to reshape weight distributions and reduce quantization error, outperforming baselines on CIFAR-100 and ImageNet.