To avoid variance, we measured the time spent on each layer for 4096 input samples, and averaged the time regarding each input sample

13 Published as a conference paper at ICLR · 2016

1 Pith paper cite this work. Polarity classification is still indexing.

1 Pith paper citing it

browse 1 citing papers

representative citing papers

cs.CV · 2015-10-01 · conditional · novelty 7.0

A pruning-quantization-Huffman pipeline compresses deep neural networks 35-49x without accuracy loss.

Showing 1 of 1 citing paper.

Deep Compression: Compressing Deep Neural Networks with Pruning, Trained Quantization and Huffman Coding cs.CV · 2015-10-01 · conditional · none · ref 16
A pruning-quantization-Huffman pipeline compresses deep neural networks 35-49x without accuracy loss.